(51)lnt.CI. 



* ' ' Page 1 of 2 

PATENT ABSTRACTS OF JAPAN 1 o/ g2s > 62Z 

(11)Publication number : 11-122116 
(43)Date of publication of application : 30.04.1999 



H03M 7/40 
G06F 17/14 
H03M 7/30 
H04N 1/41 
H04N 7/30 



(21)Application number : 10-169417 



(22)Date of filing : 



30.04.1998 



(71) Applicant : CANON INF SYST RES AUSTRALIA 

PTY LTD 
CANON INC 

(72) lnventor : PROKOP TOMASZ THOMAS 

ELBOURNE TREVOR ROBERT 
PULVER MARK 



(30)Priority 

Priority number : 97 6486 Priority date : 30.04.1997 Priority country : AU 
97 6484 30.04.1997 

AU 



mi 

T 



(54) DEVICE AND METHOD FOR COMPRESSION 

(57)Abstract: 

PROBLEM TO BE SOLVED: To shorten the time needed for 
operation and to improve the performance of DCT(discrete 
cosine transformation) or reverse DCT by equipping a DCT 
device with a transposed matrix storage means and an 
arithmetic circuit consisting of a combinational circuit for 
performing DCT operation without using any clocked storage 
means. 

SOLUTION: The substitute memory 1 1 18 of the DCT 
transformation part transform column type data into row type 
data so as to implement 2nd pass of two-dimensional discrete 
cosine transformation. Data from an input circuit 1 126 and the 
substitute memory 1 1 18 are multiplexed by a multiplexer 1 124 
and sent to a mathematical circuit 1 122. The result of the 
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mathematical circuit 1 122 is sent to an output circuit 1 120 after the 2nd pass ends. A control circuit 
1116 controls a stream of data in the DCT transforming device. The mathematical circuit 1 122 is the 
combinational circuit which does not have a storage location where an intermediate result is stored. 
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b. o-^/l'DMA^ 

3. 17. 14 7nHWfMH* 

3. 18 7^*91^-**- KO^ai-^ 

3. is. i ep^jux-tf-rji? 

3. 18. 2 MUV^y77 

3. is. 3 Xi-J-f 

3. 18. 4 t^VK*- tf-T-Wf B, C 

3. 18. 5 ^77-^*=^ 

3. 18. 6 7-^ * t 7'>a 3 y h n-7 1 * 1 7 

a. J—rjV^-^ yi/zL^—Y 

b . k 

d. JPEGf^-K 

e. ©«JPEG«**-K 
f . -v h y * K 

g . 4 7.^—7)^— K 

h. «MMIs*-K 

3. 18. 7 AMSt— Vzz.— 7.7,4 

3. 18. 8 D-^;M^!)3yho-7 

3 . 18. 9 ^rCDtetf)^— K 

3. 18. 10 «^y^-7i-X = yha-7 

3. 18. 11 ^-7i-Xny ha-7 
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r-7>3f3l 
f— 7*)\/ 1 

7—7*^2 
r-7>3 
7—7*^4 
=f— 7* A 5 
7~ — 7VU6 

7— 7VU9 
T—Zf/V 1 0 
f-^H 1 
-r-TVH 2 
T—Zf/V 1 2 A 
3 : 

♦fts 

X— 7> 1 4 : 



mmf-99^7° 

SOGCSt-KfT KWX-g-t 



/>77 y,£.tfi:T-{t7 1 -7VW8/'< v ^ 

K— -7— Sh^a- b'7 4 



T—llV 1 5 
T-7VH 6 
7 1 — ^1 7 

X-7VH 8 

T—7VH 9 
-/UK 
f-—-f;V2 o 

7 1 — 7>2 1 
2 

7— 7VV2 3 
7— 7*^2 4 
7V>2 5 
-A-K 
r— TVi^ 6 

M4fetfKl«L0>£ife0!l<0R9lj »»4*»«"ett» 

ztx±%tm&t>mbtix^?> 0 ^o-c, m-<o4Mt 
ihh** h y -A^^-^cowjip^^-rs r. t 

[0 0 2 5] #j£fcHi60y-era:, 2^* h y-A%ffl 
*xh 7 by-A.&m^Zzblz.i.&WM.tf 



ra^-^, /<v k, ^ h y y^if ^wj^aicis^ 

3. 1 h 9-A7-*r^f tOHRi* 

Hill, #iifc**«&*tP = :'tfa-*/N-H?*T 

Us 7*y y^2 0 4S:^LT*X HEtt^^y 2 0 3 
g^tlfc** hCPU2 0 2 frbf&ZM&*7 hay 

77*y -r— fa>\ It^x^^^W^ifcTJ-^ay 
—9 -y^-r-MlPC I /<x^y?7x-7 2 0 7£;frL 

-csi*ppc 1^*2 o emssttsnr^*. &*5, pc 

?X (ffifiK) ^l/-7^^7i»«UV^ 
i/XTM£\t, PCls<7 2 0 6tfffit>oX^Z> 0 PC 
1/^*2 0 6 %%^5Zb\Cj:>Q s pci/Wy^7 
x— *2 1 0 s teCDT'/W 7. 2 1 I, o— ^;M*y 2 
1 2 frJfMEC-Sftf lo^nm&WP C I K («*. 

11209) &mi&2o iizttMLxmm-rzzb&®M, 

[0 0 2 6] »a*H160!-CW:, ^-5?e3*Wfi-t?*S 
Sftfc^77-f y**^«rW£lcrsfc*fc, 7*? 
■*7n7^*-K220*f«. 779M& 
T9 J £7V—913—Y (PCI^y^7i-72 2 
1 lis flil©PCI*-K2 0 9^iri:P1SIC 

hCPU2 0 2 til, i^^^^^nfc**/* 
y<7?migt?»^-t-5J:5lCsS:ff$^T^5, /<C*5s £>S 
Xhixfi, mW79-t7^— 9U— K2 2 0^Jg{i:^^ 

h3yt°a.-^->^7"A(cftAP-?-5r i t)-C#5 0 5* 
* w^r ^ -fe 7 v- 9 U - Kfls 7*9 W^^atb^lc: 

Xh<0. -frbnthfth LTIls 
(a) 

(b) -jRftfe^raaEft 

(c) JPEGtfft/If 

(d) ^7?y, 7yi/y^, ttmmt/'&'ik 

(e) RSaftHil ffi^ 

(f) -mtT7j>mmm 

(g) /h*-*/Ha«JI (ay^-^ay) 

(h) nnmm 

(i ) h-y^a 

(j ) HSS#*/* 

7^?It7?t7^*-K2 2 OliHlCyTs^ii 
^=7 B a-tryf-2 2 4K:SBRS*tfcB-*A'> j ey 2 2 
3^x.s 7**®&=>7'ci-t?;/-y-2 2 4(1*^. hCP 
U2 0 2frb<Vfis-TtKg;<5\,^X 7 7 9W&T 9 ± 7 
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*#-K2 2 OSrjElWS. 3^nt5-t22 
414»J&J83i|6jttLS I (ASIC) -C$>5-t^S* 

&t£< ttlo©y!Jy^-f^X2 2 6«rJSia-f > 

(c, iS^T^-ir7^-^*-K2 2 0I4, *dr-v7-fti* 

ttX, T9±y V— 9 fs-Y 22 Q>\Z.\ty7,^m^ 

7,2 2 7tfffiz.tbhXt$<0. Tr-trDyy^-TXYZ'fT 
0Zbi>X~%Z> o o 

[00 27] YXfi, FCPU20 2^P 

C I^2 0 6*^*UTHI»**'^f-**aSflrU 
5X^It3yBt 3 't2 2 4fItro»i§tT 
5o i@Ht ^tifc-r-^fip— */M-=ey 2 2 3ro%?> 

-f y^.^H^=>7 P n-t-y^2 2 4 tp©df-y y 2 3 
0 X fe5VN«=i7 p n-fe ; /f-2 2 4*P©U^;** 2 2 9fc 

[00 28] 08 2tt, 9^Bt = /Ptj't224t 
i9^a{C^LfclHT-S>5 0 =ynt?t2 2 41*. Ml 

3 5©IM»TK:*)51i«©«MtA»6«ja$*v5. 

223 kmm-rz>tz£><vv-j}/i<**}) urn® 2 36* 
jwtL-o^o mm-i>?7=--xfflffl&2 3 7ii % 7° 

AH" 5. J^ia-f ^^7*-^«0#|l|B2 3 7tta-*/^ 

*U«fPfB5 2 3 6 £rt*B88Ri5Fft-CV*5. 

y »J»Si5 2 3 6 t tMSM y? 7 2 38ttt 

W)^?7x-7^ y^2 5 2Sr^-LT^$tvT 

*5«5 % ATM y?7x-^^ 5 'f2 5 2it#^ffiHPft 
2 3 5 b&ifcZinX^Zo A^v?7i-^Myf- 
2 5 2tt*fctf*-fc/»':a-— tf-Mif 2 4 6 if*-*** 
r^HM2 4 0lr^$*VCV>5. AT^^*?* 
2 5 2li, im4 % sjry*-7.ljffi&2S 
7 ia-tf/M^yfMfl?^ 3 6frb<D*r—?*xJ'y 

«240« -fcvUtf— Jtf-*--r If 2 4 6 iCfe&fZlt 
£><Dh<DX'&% 0 

[0 0 2 9] ^{»>f ^7*-^«HBSS2 3 8tt, Hi 

4>©pc i/<^2 o 6 tiifl-f-Sfcfelcy^^Pf^PT 5 

D-feyf-2 2 4«f»K#WH3;h,T*J»>, #^ffifPA2 3 5 

»2 3 9fcSiai$n» 37*BtyfZ 2 4 tffi^LTKl 
f^1-5ffi ; ev J ^-^2 3 9*5{ifc.oTV>5„ 
[0 0 3 0] r-^*tr^2 3 Ote, S^SfvCV 



5f_ ^ ty «> a 2 3 OteffiAWffi&k^Vv-cfflV^ft 
n^n-fe^iJ-2 2 4 fc^TSI ##c#&/H SftS 

P>H2>„ iiEroifSiiffc&Ste, itLtJPEGfffft 
/f«2 4 ^7 f -*^Xg|S24 2{CioT^ 

&©7*-** h y-A©*aa#tTfe*t5, see 2 4 1 , 

2 4 2fctM?'Ji-f ^-tr/W:*-— tfi-J-f 2 4 6 2o©t 
^VKt-Wf 2 4 7, 2 4 8Kl«IR3*Vt^ 
5 0 3&241, 2 4 2^?)0«ii$}lfc^H)-A 
tt, 4 9JC|ej£$*U ^Ht?fetb« 

2 3 0fcfll*.T, t°^-fe/^-^i-^-!f2 4 6 fcifeSM- 

-Wf 2 4 9t©l3i:vyvf3-^ hffi (muv) 

/^7r2 5 0Srfx.t^5„ g^-^ft'f f 2 4 9 

iS&S'Cfc^tf^'O^^ai— ^«»« 

2 3 8, c— *A'^*yiWWaJ2 3 6, ^ 7 * 

~-xmffln2 3 7\mt>£tiz>o 

[0 0 3 1] H24 J w^'C*$ti-Cv>5<t5tc > $e> 
t£Z> («3(0) x-^^SFI52 4 3Sr, JPEGfftft 
/«-^§2 4 1 i^^x-?^*^ 4 2tt^fci 

fcs. rafcsw±**t;w±©x-*^*&pj&i- 
s^t trout wc*>a. '<*n wm^i & 

«fic»ffi-5 t (rttE^ti 

[0032] E2©AS I C<D±ftfflim*&>T<Dl:ot£ 

Lfc t lt t> Ara© s trr*«*n r t tt*v 
y v * A *&ib lt v n§ ng k a v ^wjroaHfcas 

t>/>- K7i7»ilIttl;i577'D-f(0^i L 
V\ 

[0033] ^2ic mwteasrHfT-rsotc^iift* 

7?7U #^yy^rfcK#«i:i-*/N-K9*T<ftt 
-Mcj:o-C*«ttk:ll!ll»**iS. i-*fct>, »fls*tf 
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[0 0 3 4] rtUdttLT, ^©ASICOff'fV 

(b) P^^r$n5ritl*v\ o 
HtdS^vNTv>5„ r confix -wtim&i- z> 

[0 0 3 5] ZOkottM&fab, V^<^fy7' 

[0 0 3 6] Sfc, **©f*~^«Wl/^tKrJ5i, 
4itoJ;^jPEGm/«t 1 fcS^fiiMBft 

[oo3 7] jeic, «y®&it«> -cosmic, «.a«wt-» 

»*<D«»J©f*-*«:tIiWb, ift—rs JE 
■tfc*»fc*WW-»£fc lot, *f-*<Dy 

[00 38] ' 3 7 , Dt 5 't224 

i3*^hCPU2 0 2 (HI) ©M#P©TT?*ff**t<5 

yf-2 2 4±«c©W«r*E%-r5. 2 3 5 

tt, CBus (C'<*) t!^«tl'5$lJ^^2 3 KCt 
ot3^nir-7t2 2 4 SrlW^-ttS,, CBus 23 1 
i*Ztl?tl<D^i?=>.~/l''£<D±y (il©2 
3 1) Sr^tiHrv^-A^ 3 6-2 5 0 <0-t*V^*tfc8S 

5. 0 2^m^-r<1-Sfc*IC, H 2 T?ttlW»^ 2 3 
1 *»6*|l4P*W!)*^a— ^2 3 6-2 5 0 4T*©^M 

[0 0 3 9] H3f4» £J/SWt6fc*^»--A'l'^**0> 
DESftft W 7 9 h 2 6 0 t/T LfcHtfci. 

h2 6 0f4, a/nt 5 't2 2 4 ©£#IW*|l©fcfc© 
^^2 6 1 fc*4MffflS2 3 5i:^*n5, 



tyt^a-;v23 6-2 6 0i:tt, Wfanvy?.* 
2 6 2*5-g^tl/5c 

3. 2 h/s^ctyt • *a — (W 

2 tmfca:AHry*2 0 4 fc©We©16BI;6S+#fci: 

[0 0 4 0] SftOa^a-i'WrAtt, 

a^P-fer-yy-t -t^m^-jlco^^yi*** 

j)M * y (D^ffi «r**.fc«fc* * y aMWJc**!* r. t Sr 
«BE1" * ft: » t , C P U i a * y f - i O W ttflT*» 

[00 4 1] &&}\c&\<9mxtbhft**V fcfflv\fctf, 

mc^xis< ^ th^mxhi>a>K mm*&\ ±# 

[ 0 0 4 2 ] «tttlB#wffi{r, r^ip^fA-c 

[0 04 3] r:*t?)©nH*»*-t-Sfc»K: % wfA 
i"6PI»l««^i6«?-C*>S. H4K, (*^. h) CPU 
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[0 0 4 4] SUlC^T, CPU2 0 2li^rA4> 
0D-f'<T<Op«^!J^SSr^LT^-5„ CPU2 0 2 

i#, *><5V*tt:a:/n-feyy-2 2 4 "Ccofljffl^fcft 
fc, ^y 2 0 3 SrW"9 3TS„ 3/0*^2 2 411 

7 e o-feri/f-2 0 2 t^LTV^^^'J 2 0 3H*b$c$ 

1 0 2 2^Tt5rti5f§5„ rixfeW^O-ttl 
«lttlS* 1 0 2 4 3r#W* *!)20 3 lc»#3itf ^ t 

2 3^11 0 2 4SrlS1t1-5tCg1-5^*y 20 3<D 
[00 45] CPU20 2lt a/otyt2 2 4|£j: 

orssffjns^i o 2 2 ftys^fctr 5. 

CPU 2 0 2 t3^nt5't2 2 4 £(DNJ<03fe?lJtt&ft 
^fc-fSfcfelc:, CPU2 0 2(aotM§ilfe^ 
»4l o 2 2tc^$^5 i 5C*=- -tyrstiXfrb* 
•7°X2±y J f2 2 4l£*5V>T|l : ?T£tl5o ^-1022 
f0>£lfrfrtt, 3/nt5't2 2 4(Ofc)timbCP 
U2 0 2(c4o-C#l 9 %X e>ftfc#W^y 2 0 3*© 
VK1 02 3-^* 1 024 *&mt% Zb&X 

[00 46] H5|c^-t-J; ^.tih^m^lotz 
#lc x **£j&ttl 0 30 v ^*!)fiai0 31, ^ 

»Hffa»i o 3 2 #£8**1/0* a. r^-r^-cco 

ct^a— /M4*:* hCPU2 0 2lt|-7 , nt^tL 
THfT$H5„ a^tyt 2 24 IC*5tt52&fi ! *M*tt 
1 0 3 0.il*3V*T£j?fc£*t, **yfFa»l 
0 3 1 Of— fcr^«rflJffiUT^$itfc^<©^7^ 
Kl 0 2 3*tt*l 0 2 4CDfc#c0fg4££S)jyy|T£ o 
*fc, lft^£j&£|5l 0 3 0f4, ^-fa&l 0 3 2<D 
•9— tf^^Jfflbt, 37 e n^ S ;f-2 2 4-C^tt-t5# 

[0 0 4 7] T'n -fey? 2 2 4Rl*JV^T*ff 

c pu 2 o 2tt^-=&y§a»i o 3 nao 

^^KtJ&S^itnTtl-Cfct), ^©^T-CPUKJ;-? 
-C^^y^Jfe^tbS. = :/n-fcy<y-2 2 4sWIM* , fcll* 

x.5 £ ®m\zj y^y-fv ft^£&W l / * ]) Zteiki- 

•5<£>04&<, a:/P*y1f2 2 4fl$fe^*|**.fcfW> 

&igm$ftztii&\*, y*y£aau 0 3 1 t^-t 
asp 1 0 3 2 1 <DvmkfcftLx& <o , ^jffl^rtB*^ 
7 1 a ^ * y s^nty inih* k# * * y aic 

[004 8] 06ft, 3^u-feyf^*a-10 22 



U 2 0 2{dJ;9^V7 f ^ 1 0 4 0 (C^fA 

3/Ptyt2 2 4.Cj.oT«E*tH$ft£fTlC# 

$tt-s„ 3yntyt2 2 4te#tt33fc*?ft.aas*'rr 

2>£, Mtt?!)-V7?^-10 4 ltCSS 

[0 0 4 9] -fft^r:x-l 0 2 2 g#(4@£$>-5VM4i!) 

3.-IO 2 2f4, CPU 2 0 2tCi5^<D4^t='7' 
airyf-2 2 4te*Jtt*#*05*fT£«:5MttLO^. 

0 3 o*»e>©K#icjtscTy*y*a»i 0 3 

1 (05) {r«to-cf>j<?srt)ti5. %jv<<k&£tiit 

ffil 0 3 1 fc#»—*a«Bi 0 3 2 i<o&PKif££|e»> 

g IbWlcawS"?* 3 J: 5 LO*5. 
[0 0 5 0] #^3.— faSPl 0 2tt, a^o-fey-y- 

2 2 4«s*^J*Bl 0 3 0fcJ.oT£j*$*lfc4H*$r 

^-yiras&i 0 3 iicio"c«9S-ce>ix5*^*a- 

1 0 2 2 i^y 2 0 3riS-h9-*#("t*U4, 3^ntj/ 
■y-2 2 4££<#o ! ftg;5S;fe^;!>\ M4'>&< kt> 

3o ^<d«f^t% ^roL-^aifBi 0 3 2 ty^ey wa^p 

10 3 1 t©RJlM6««j*»f^#H*&$ft5. 
[0 0 5 1] *a»l 0 3 2(rtoT, 

S"C*fr*v\ p< * y f asc 1031 aswffl^rtBft^ 
y *s/> * < * y 00 fe 5 , § v n \tt£ < * o it r. i Srti 

WUfc^rtt, *a-fl»1 0 3 2W!)-y7y 
^*&aSr*S*L,> a^o-fey-y-2 2 4 J: o T t> tt^ 

±y> >^y taifpi o 3 in, cpu20 2^ 3 yp 

■fe y-y-2 2 4<£r^FO, fcSVM4=J7 , nir 5 /-y-2 2 4 

Siifc^lcKf *^*y R*Sr»JS$*5ii:^"C# 
3. 

[0052] ^^ytagpi o 3 ia>b*a-siai 

0 3 2{C^T^^^ y-^Ty^-r 5if**aLT 

t-, ##£f*»©i*u«#&»fc-rfcj&a+#* j ey 
di«iifc$*tft^ofcjs^Ktt, ^ * y fasc 1 o 3 1 14 
*^—<gmn 1 0 3 2^^<v7*w y^^fi- 1 0 4 
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^(D^<oi)^m7ir 5*tCPU2 0 29SM\ffn 
-oti^mT-tZt. c\ftb<D^<DJr^yVYtmi&$: 

ft. %*zm±-r\c+tt%**})m§bft5>o urn^n 

©^li'<>7 ! -f>'^ :J E-*-l 0 4 0t|fftLtte 

5. CPU2 0,2A5#lf^7-fy/ 

^dr=.- 1 0 4 0 *<D-&Z? V ->T yf-f^ C\ t 
icitK ^^DlWlOSlttot+^y*!)^ 

#^^sci o 3 oom^mtz-tz. tit-e 

[0 0 5 3] n;/n-l?5/lJ-2 2 4^yf-f V^^-O 

^/ci-fcltcoy ^& V tmiikZftftfritz b ^ 5 

— ^.(D#a-^f^ ^ y 139S 10 3 1 Uf'itW^y 

5 J: 3 t ft 3J>o«Jt« v 5 a -7*ti. b°*m^ 

t\ :ni:J: 9**£j*ttl o 3 o ©S**«tfc"rtc+ 

[00 54] ;icQ.fc 5 ft^y 1=3951 0 3 1 i**- 
13951 0 3 2i©tap»m£J:9s v^r a 
ftfc^y 1:2 o ^^-Q^m^^—zfy t-&9t± 

$cD ! &2H4fl'>ft < ft 9 , J: 9 *t ft*A"-y v Y 

37»n-feyiJ-2 2 4/^U^^y 2 0 3§r^oT<73«l! 

a^fc5*T?#«|-r-5ri:^#<ft«9, fiJfflWffift^ 

* y *J/>ft < -cbttf^tti-s kcxovm\t£ik1rZ. 
[0 0 5 5] ifo<fr£j£95i 0 3 o*»boH3fc£?Sfc-f-^ 

K^y 13951 0 3 1 sws^^r/T'SreiTK:* 

* y 1=395 1031 #S**j»fc1-K:+£4 * * y 2 0 
3iWWi>frZ SiWi^S. +#ft*^-y;i s #£>tu5 
»£lc ft* **y7fc*rM-«. 
#e>ftft*>ofci§&lcti:, j/^icit^ g#£ 
Wfc-T^ i 9 iiS(ftfeaiCiitf 0 

1 . fijffl^rtgft^ * y 2 0 3 x-gxzmc-r z t 

2. 
3. 
4. 



ft*5> gjjt«r»fc-rfc»lc, ^yr^v^»5*>© 
Hft595# (W*.tf, 1/3^2/3) 

[0 0 5 6] i7 (C&^T, ;>< * y 1=395 103U* 
a— 13951 0 3 2 t©Kt?©tSPttf^lC*Dx.T, @£ 
f#^^y77 1 0 5 O^nfc^-tdtt^a- 
13951 0 3 2i5aytityt2 2 4 £ i S Zl i 

y^^a- 1 0 4 0IJ^ 1 OfiCO^^-CD^ra- 

ittSifta. 2>:(r3 7 ,> c-te->f-2 2 4tA*$n5^ti 
[0 05 7] «tt#8&h,fc»£-|Ctt, * a -1395 1 0 

Ktt* 3.-13951 0 3 2lc£o-CifA$;ft5£rLVMft 
^fc&SSft+#ft£#a s #;S$c£*tS. *rU*H*«r**- 

i^-'J y^-t*5^<Odr3.-13951 0 3 2 

1. 1^*^-1 0 4 0!c +^W«oT^5*>f 

;* M-5 

2. Tl^fcV^fr&tt, a^BtyfiS 

3. «fu^**a-te#Ai-« 

3951 0 3 2<D«jfm£lT©»9T-fc5o 
1. ^ ! (*TUfct3yPtyt224i>^^ 

1 . fisft*^? > K l 0 2 3 tc&Kfc* * 'J &* * y t 
3951 0 3 KCg#-f5 

2. eai-rs**sr*ia-*-5 

3. a^cir ^^£^-13951 0 3 2KiEj&L 
J^_bcoSjf£:/ p -fc * SrSIM a - KOJ^-C* Lfc0!l £«T 

[0 0 5 8] 



y^^-<D-95^Ti-5©Sr#o 

y^y 13 

AL LOCATE MEMORY 

BEGIN 

i f mjk%mz.i-<Dic+ftft**vi>mbft%^b-r5b 

THEN ^Tbfc^-T^TSr^ y->-Ts/7° (-#) i"5 
END I F 



-li- 



THEN WAI T_FOR_I NSTRUCT I ON Wffl 
END I F 

I F g#£*»fc1-©K-B*fc > * !J btiK^k iri, t 

THEN ^7-&tiiJ3LmZ> 

end if mv^xtc**)) zm-f 

S CHEDU L E I NSTRUCT I ON 

BEGIN 

if ^**.~\c+#te&mmbtite^b-rz>b 

THEN fc5g»f^o^Sr=i7 ,, D-fe5/f-^»ITi-5*-e#«i-?. 

END IF $frUvfr^£3r=.-(£#ip-f 5 
END 

WA I T_FOR_I NSTRUCT I ON (i) 
BEGIN 

WH I LE m~TV,X^Z>h<D(D? V —VT v T^ftTV^feV m$H*# 
DO 

then ^y-yry T'a^^^^w-t- 

ENDIF *»-a»b*7 LTztisit&timi-Z 

DONE 

END 

G ENE RAT E_I NSTRUCT I ONS 
BEGIN 

ALLOCATE_MEMORYWtliU ^t^7yKl:M* 

SCHEDULE I NSTRUCT I ON SrPftW a^ntyf 

END 

• [00 60] 

[00 59] 112 tp<D^ t J*.—M£ttLX. SUi^T" [ilA] 
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eic cfg 


Cdnfig2 




eic stat 


Status 




cic err iot 


Interrupt 






Coafig2 


x >»— fc«fct£M DjM^ :7;v 


cic test 


Config2 ; 




cic_gcn_pob 


Config2 




eic high_addr 


Configl 


~S7 H Uyt-iK £ K 


eic_wtlb_v 


C6ntrol2 


TLB Invalidate/Write JB{Eg7 H fcl&ft 5 If 

•yV 


eic_wtlb_p 


Config2 


TLB Write ffl 10*37 HVXiftlffiPfcf-y h 


eic_mmu_v 


Status 


Most Recent MMU A5G87 H VZ. 


eic mmu_v 


Status 


MMU T7x-7^$tlfcMofl Recent 5? 


| eic_.p_addr 


Status 


PO Bus \Z70 izX'f 5 fcJiXO Most Recent 
lBusJ8&S7b*UX ! 


| etcrpaddr 


Status 


PC3 Bus \Z7 iJ-tT&Ztzlb® Most Recent 
RBus fflwS7 r 


eicjg^addr 


Status 


tgfam Bus 7 ^-irX-TSitto© Most Recent 
IBus/B7l ; t/-7x 


eic_Tg,_dala 


Status 


Bus \Z7 9tt-i-Z»tttt)0> Most Recent 
RBu 5 ffl7HUX | 






VLW 




lrni_cfg 


Control 


tm m ^ y 7 ^ y i/ - is a y v vy\ * 


lmi sts 


Status 




| lmi crr_,int 


Interrupt 




lmi err int en 


Control2 


x^-Rtf-f >^97h-f ^-^H/yX? 


lmi_dcfg 


Qmtrol2 


DRAM 3>7Y7t — v'a^v^? 


lmi_mode 


Control 


SDRAM t-Hl/y^J' 
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J9ZK >*-7x-X3 > 1'S*** 


picjrfg 


Config2 




pic_stal 


Status 




pic errjnt 


lmenupt 




pic err int cn 


Conftg2 




ptc_abus_cfg 


Goolroft 




pic^ab us ad dr 


Gonfigl 


ABustraiisfcrfflffli67Fl'^ J 


pic ccm_cfg 


Ooalrol2 


Centronics >7 4 V U— i/ 3 > t»W 


ptcoent^ dir 


Config2 


Centronics tf>^ h • 3 > h U-)VV> 


pic_rcverse_cfg 


Contml2 




picjimecO 


Configl 




pic_timerl 


Canfigl 






I VVTs** 1 




mmjcfg 


Corifig2 




mm slat 


Status 




mrn_crr_int 


Interrupt 




mm err int co 


Conrig2 




mrn_gefg 


Config2 




mm_diag 


Cortfig 




mm__grst 


Canfig 




mm^gcit 


Config2 




mm j»exp 


Canfig2 




mm^gint 


Config2 




mm active 


Status 





[00 6 2] [SIC] 
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9<7 






ic_cfg j 


Cbnfig? 




ic_stat 


Status/ 
Interrupt 


Xt~97.Vz?7s9 


ic^crrjnt 


Interrupt 










ic_err_int_en . 


Config2 




| ic_ipa 


Control 1 




jj ic_tda 


Configl 


X HJ— ATodo ^>»X^ 


ic fna 


G>a troll 


XhU-AfcTUv'X* 


ic inta 


Configl 


xMj-Afak*u^x* 


icjoa 


Status 


Last Overlapped Instruction Sequence (fif£ 












XMJ-A 


ic_ipb 


Control 1 




icjdb 


Configl 


B Xh'J — A Todo 


ic fnb 


Contrail 


B XM-AteTt^X? ! 


ic intb 


Configl 


B ^hy-ASk*^** 


icjob 


Status 


Last Overlapped Instruction Sequence 












BXb'J-A 


ic scma 


Status 


A XMJ-A-fcV7* 


ic_semb 


Status 


B XMJ-A-fev:/:* 






9^ 


sua 






dcc_cfel 


Confxg2 




dec stat 


Status 




dccjerrjnt 


Status 


DCC X7-XT^^XVyX^ 


dcc_err_im_cn 


Control 1 


DCC Mii*'r**-^Hiy h 


dec_cfg2 


Controi2 


DCC Zl>7 4 {f\s-z/3>2 K/V7.9 


dcc_addr 


Configl 


H Hffl^X7 K l-X k>>X 

9 


dec IvO 


Control 1 


5^>0— 31 ffl<0 "valid" tfy hXr— 


dec !vl 


Control 1 




dec lv2 


Control I 


5-f> 64—95 Jfl^ValkT tf*y hXr^^X 


dccjv3 


Contrail 


> 96—127 rUCO'vand* t'y hX? — J? 

X 


dec raddib 


Status 


> F^-xw^Mif b K i-x 


dec raddrc 


Status 





10 0 6 3] [ilD] 



-15- 





9*7 \ Wl 


dec test 


Contrail | DOC W h I'S**^ 






no pfxr 


Conf^g? 




po stal 


Status 




no pit inl 


Interrupt 




no err tnf en 


Config2 




nn Hmr 

l/VF U 1 1X1 


Config2 




JJU DUUul 


Coafig2 






Status 




po ten 


Contrail 




po s&id 


Con troll 




po idr 


Control 




po_muv_valid 


Control 


MUV fiSJHv h 


pc muv 


Configl 


MUV RAM <0^-X TFUX 


[«1E] 






oobjrig 


Coufig2 




oob stat 


Status 




oubcrrint 


Interrupt 


x^-yaaa*^** i 


oob_ciTjnt_cn 


Config2 




oob_dmr 


Oonfig2 




oob subst 


Config2 




oob_cdp 


Status 




oob Jen 


Contrail 




oob said 


ContiQll 




oob rile 


Contrail 






ooc_cfg 


Config2 




ooc stat 


Status 




| ooc_errjnt 


[ntercupt 




| ooc_err_int_en 


Config2 




\ ooc__dmr 


Conrxg2 




g ooc_sahsl 


Config2 




II ooc_cdp 


Status 


y<? \ 


| oocjen 


Contrail 




| ooc_said 


Control 1 


nr^ymmr^^T. 


] ooc_tfle 


Contrail 










JPEG t 




r^^cfg 


ConfTgZ 




I jc_stat 


Status 




jc_err_int 


Interrupt 


X y -Rtf&l&frX T~9Wi?^5 | 


jc cirjnt_en 


Config2 




jc rsi 


Configl 


■jx^-mbhb 


jc decode ^_ ( 


Control! 




jc^res 


Contrail 




jc tablcjscl 


Contral2 





[00 6 5] [SlF] 
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mdpjtfg 


Conngz 


zr yv A " v y 


mdp_&tat 


Status 




mdp_errjnt 


Interrupt 




mdp_CTT_mtjGn 


Config2 


w- — /fnlll It V -V , . II, 


mdpjtest 


Config2 




mdpjQpl 


OoutroTC 


Saifp i 


mdp_op2 


Contrott 




mdpjwr 


Control 1 




mdpjji j 


Control 1 




mdpjjm 


Contrail 




mdp len 


Control 1 










TO Cfg 


Cbnfig2 




| ro_stat 


Status 




ro err int 


Interrupt 




ro__crr_int_en 


Config2 




ro dmr 


Config2 




ro subst 


Configl 


AgAtel^** 


ro_cdp 


Status 




ro len 


Status 




ro sa 


Configl 




ro idr 


Configl 




ro vbase 


Configl 




ro cut 


Configl 




to lent 


Configl 










A read only copy of PCI ?>7 4 Vl'—i'n 
y&Bls&T. * 0x0 * OxD t OxF CatH L 


pci_cxlemal_cfg 


Status 


'J -fe y H#. 910 5/ U 7 to ROM 6 > 


[*1G] 








Config2 




lis stat 


Status 




us_en_mt 


Interrupt 




iis_errjnt_cn 


Cbnfig2 




iis_ic_addr 


Status 




iis_doc_addr 


Status 




us_po_addr 


Status 




iis_burst 


Status 


PO, DCC & IC fr£>ort~z hfi 


iis^base^addr 


Configl 




ii$ test 


Configl 





[0 0 6 7] £ftfc©W^**1 s -e*B-^#'fe«>tt:# (a) (ic_ipaiic_i 
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©{EST Kv*4rtt*Wa. (RfflT Ku^O#MHt#fr 
ii, 3 2fc*s/ h©^-^^#***H--*S*u S'— ^ 

=t^P-feyiJ-2 2 4t*^bCPU20 2^(C*5^ 

( b ) * (ic_fna£ic_fnb) 0 

(c) To Do (ic_tdafcic_td 
b) o Ztltb<Dl/i?X?^T\-Z, (VtfZthX^ 

(d) JWyzf V (ic_intaiic_ 
intb). Z.n(b<Dl>i?X5"<T\^ -fV^Tt/hS: 

(e) -Y^yr/htfjil^*:? ( i c_s t a t. a 
pritnediic stat. b prime 

d) . ift&OUS^^Ttt, <<>9y-7Ys J&7U- 

?77h#jl ( i c_s t a t) \si?X?tp<Dm<D4y 

(f) Wt^ZT VM^-vy *T (ic_semai: 
ic_semb) „ *XhCPU202H:, =iy°u±y 

jitter &s t u<t ^ us>* * r * * o^a-^m-c t> 

nni-Z>Z.b&-VZZ<, FCPU202^tv7t 
7£A¥75^itcttl§t75*£rau SKEHfr*©^ 

3. 4 t»^M)-A7t-"T?h 

5o aim, i o©*h** b y - AHtfflA^-f * 



fc5t#l-3yntyf2 2 4© ; e^a-/V^lt 

<Dco|^Ci5JC|ltT$ti2>2o©^^ V-y-A 
S** h cpu 2 o 2_kx*W)i¥VXh^z>y7 h^ric: 

(* h y — a a) {4,(0^^ h y - a h y -a 
b) i «? tiW^ffi5feST-»f^-rSc 4M**M-A3b$ 

^fi^a— tt:** HRAM2 0 3 (Hi) 4>C>— 

2 0 3©W*'J #^lt*^hRA 
m 2 o 3 <7?4£gt ^ ^ y fflgHcmti £ ixs r. t tW*. t 
<, 7^^H^37 p ci-feyy-2 2 Y\sXti*b 

^ h R AM 2 0 3tfi(0*l-iS-t-5^aT KU^SrftS't 
5 0 rtL6«^{iNR*(c:=r7 p P-iry-y-2 2 4»n-^7 
/V^*y{C*&iW$tb5. 
(0 0 6 8] @8{i, Jfc* h RAM 2 0 3+{C^$^X 
TV^20©^ by — AAiB©7i — 7 5/ h^-flU 
T*fe5„ ^> h y— AAi:B^rtt- : e4x(07*— hJi* 

*a* hy-AtB^. by — A<D2o©^^ssi^ by 

— A 

*Xhbt><D* hy-Ais®5fetl^W-r5ri:%T-t5 
ry^7>-Knt*Vj ttKi«*«l*Sa:K:1-5Ci:t> 

*if*,f)/i^x h y-A$r Tnj/^j Lt> ^hy-A 

*if*je>^<D^. hy— AAs^-cfcoTtav 

* t'hbfixox h y— Afi, ^(0^*5 r*-/<?^ 

t"j L■cv^±5/J!4^^Sr•&^'T•v^Tt>Jl^^ 

* 3 2 e y kd 1 o-f oiiJpi-5 i 5 4 r-jt 

*##J#PW 2 3 5 3/Dt5-t2 2 4©fflE©Str 
ffiJIWSrtTpfc*^, i&g&!fi!Ffc:** FRAM2 0 3^b 
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3 5{±^«^?rtTV\ CBus23 1^lt ; ev? 2. 1*3^7-^ V* 7 7" Met *) ft^ftmUtfftit 

ti^zmnz-zzimzft *>, 3. ^y^^y^Lx^^^n^y-^m^h^E^ 

[0 0 6 9] ^$»g|S2 3 5-CHtrf *» 

•l'^Ki40©^f-^2 7 6 -2 7 9^b^5. d» 

mix?— v?2 7 6-e» s h v—Mz&^x-fo-St 5. }?hbfr<D* hV—^y-try 7^ 

ls?W:WiX*hZi&-&Wi, ^7xyf LT2 7 7, 6. ¥hhfr<DZ> V ]) -A^yf-f V^'p^LT 

m^P>U J lC||fTL2 7 8. k^^S:ISt5 2 7 l^334»iT5*» 

9„ gTFK^-f = - TO, iilB/V-McX-^Tl^ri 5 

3.5 ^&<©7*7^;/;* hy-AO*S ^TV^LTi^ri^^^-fST/i^y XA 

e>ftV> 0 SrfflV^T, ^*J1»g|52 3 5ttC^S^a«^LT 

1. ^^•^yr- ) , y^L.-Cl/^*^5^<0^ K?i7i LT*g-f 5;: 6. 

2. if<0^^ MJ-ASr^(C7is/^-f-5/!)^*3£ [0 0 7 0] 

i f 319-^- KT-&<, SHI*— KT*fc <9 , /U KT-i&< , SB 
if A;* h U — A#5n y $ ^tLTV'Tt^ihtf T'^V^ 

if a* h y -Jx&mm^-vx-h <o , a»o r a* y y -Ao->--!ry 

else ^li^V-r-f V^LTV^iV^ 
end if 

else if BXhV—J*frvy? £tlX\/^Xftlk>PXt£^ 
i f B* h 9~J>.^mWi^-h'X'h t ), t*o rB^hy-A©i/— y 

else ^li^yf-f^LTV^^^ 
e n d i f 

else h y— kfrxiy? ZfoX^ti^*/ 

if a* b y -A^fj^- K-cttJk<£-?&v\ a>o ta^. h y - 

else ^n.-^-yf^ y^i-tv^^ 
e n d i f 
end if 

else /*-Yy ; ?7a:-^$lJ^g|5^»)LTV^/ ( CV>*/ 
end if 

^frteZft&ty^yfjyyLX^fo^&a*, ^mm^2 3 5i^yf 

[00 7 1] if^^ 1- y-A^T^x-cy-Cfc^^, <b* 9, *&KH?TUfci*rH5-* h U -A if t>fe 
2. AiBro^ hy-Afc^^fctttftt^Sft-as fiJlTtt, **W#P»lcJ:o-c*»*n*JWH3-Kfc* 



-19-- 



Ufct>©.T?&9 s Wko^mz-T Z*rj-?htiiZ>* h [0 0 7 2] 

if A;* h ]) -A^D y# $tlTV>5 
hy-AfiA 

else if MJ— Atfin y 7 £;ft,-C^3 

h U — AfctB 

else /* if*^©* Ml— Afcn y £ ^sh/T^ft^*/ 

if A* h y — A*JigE8(i*— K, Ta^ h y -A©->-^y^#^ 

asftuh^ $>5v>»A^ h y — Mc-fo^&fc&tZi , rB*hy— 
tb*— kt, tb^ h y-^^^-^v^s-i-^ihcf", $>5^(iB^hy-A 

else if B^. h y-A^W^— TB^ h y-AcD~>->5r 

o Ta^ h y -A^S)*- K-C, TA^ h y-AcD^-^V^S-^^ittf . 
fc5^«A^ h y— Atr^^#fiEj j Lfttttu*, ft©* h y-AKtB 
else /* ¥hb<DX h y-At>ift^#£L&^*/ 
if p r i =0 /*Aii5, Bffi*/ 
ft©* h y -AttA 
else if pri=l /* A®, B^*/ 
ft©* h y — AteB 

else if pri=2or3 /*7 l )>'Kot*y*/ 
i f ft&©* V y-A^A 
ft©* h y-A(3B 
else 

ft©* h y-A!iA 
end if 
end if 
e n d i f 
end if 

So 

3.6 ^ftffl7^r^^h!)-A©7iyf^ ^#7* y t , A^iHM 2 3 5 

*©r^^-f^**^ hy-ASr*^tSt, ^Hffii U ^^*tfi-5fcfc^3^oiry-y-2 

SP2 3 5 tt»ft"t5*^-#>r Vy"l^*y- (ic_ip 2 4 © u-^*y" 2 2 9 

atic_ipb) ^(DT YV^m^X^^y^.y [ 0 0 7 4 ] 7**jBfft=«:7 , n-fc yf- 2 2 4 (C&l^Tffl 

3 5^(75 7*1^71 y^y^rrtl^ftl-Sll^lrii, U2 0 2ji»fe0il|HHC,fco"tSlf?S*l, ** MC*tLT 

[0 0 7 3] SXT<D&mmtc$ivZb%\z, yt^tyhtiii4S. ift^fi**bRAM 

y^y^rt^^n^SfcJC/iS. 2 0 3C:*yft£*l % E 1 ©P C I a** 2 0 6 Sr^LT=> 

1. /W7iy^y7rWCfc5 ^o-fey*2 2 4^eaSSJi5fc», *^tt"e*5«»t 

2. /l/7iy^77r t t I ©^Wo7^ /f / /J^b-r^tT-feS. fr*L<fi, a^nt yf- 2 2 4 
* hy-Ail^c* h y-A^e,eofco-c*)5 ^-©^fcio-cHtrPfli&^ttSrt^Stbv^ 
:/w7a:y^<y:7r©rt*©tf»tttt, ic_sta tilt. &*<Dmmz%±Wltt&'zimbtZtcit)iztt. -fa 
t u^***©7°U7*y^fc*y MCcfcoTSStU £ h©*fttt«rT#*«lt«»-*-Sri^a*L 
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[0 0 7 5] 01 Otts *fc«V:9*3 2 tfy h©8!7- 

K K) 2 8 1, 

-K2 8 2& < atfo 3o©*/<7yKA, B, C<D7K 
283-285 t>, ig*7K^2 8 6i:i:t)ICt 
Sic, &4£2 8 7 fc, *X hCPU202A5ffl 



[0 0 7 6] @11H ^(D^ity^- K281© 
#5t2 9 0Sr^LfcEl-t?S>5„ ^^T^- Kfi3 2 t* 
y h«-C> ±*^=-K2 9 1, M^-7°=>-K2 9 2, 
-f^77*F (I) tfyh 2 9 3. -W3tf§- (Pd) t* 
y h 2 9 4 , (R) If y Y 2 9 5 , n y {? 

(L) \fy Y2 9 6, 2 9 7 ^7-K2 

9 o ©^jven.©? -< -/u KoRwtenToafic^-f. 

[0 0 7 7] *^3-KRW 

[0 0 7 8] 

[*2A] 



[0 0 7 9] 



S2 : jJ-^a-K 






major opcode [3..0J 

K> 


0: Reserved 
1: ttfttt&£Raa«lft 
2: JPEG Effitttft 
3: ff JWHW 

4: (S»3>^-i73> 
5: 

6: r-^SHHt 
7: A-7h-> 
8: BMRii*tt# 
9: /t'J 3 tr- 
ii: ^mnmm 

12: 

13: £t£ 
14: Reserved 
15: Reserved 


minor opcode [7. .0} 


A^0)f$lffl. d©7-f-;H«CDf?%<tttmaior opcode HJ: 


I 


1= S7B9KWii*4ffik 

0= s5Tecsda*if?jhaL 


[«2B] 






Pd 


1= "partial decode" BM*£ffiJHT£ 
0= "partial decode" «IS£ffiffll/£vi 


R 


1 = Pixel Orga Inzer's kftgtl>*J7,Z (po Jen)JC J: 0 *H£ 


L 


1= ZOfa^A h U-A(A or K)\*XV>$t^(»1t)t>\Z 
"locked";* tl* 

0= CCiSfr^X h U-A(A or B)tt*C0^^Ofc»lC 
"locked" 3 tt&H 


length I15..01 

iy> 9^ -"^ K 2 9 3 


'%"by H 5^ M fcffUna. -Wfrt: 1 y h 2 9 4 



[0 0 8 0] I 

#5. 4*, CK^vy-yT'Mi ttitWfcTJyfi-y? 



y h 2 9 4<£>tf y hri*-iry h£ft s i c_c fgl/-^ 
9 *X-W®*- Kit* 4 1 . fiATfciB^a ± 5 I^^cd 
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2 9 6^ fflmchtc*) l^htD^^^t-T^Wk 
-AH7xyf$h5 0 f $7-f — /V K 2 9 7 l'± i S^h 

& 3: ^^7>F«^-fy 



64>ro P o_l e n ^v^^PjATjfi^So SttW 

10 0 8 1] Ell OKfc^T, *>3*frk:&S&:dv<? 
VK28 3~2 8 6©*r4ffl^5l^*^:/K:JfcCTHr 

[0 0 8 2] 
l«3] 











input 


pixels 


3 




input 


pixels 


2 


JPEG 


input 


bytee 


2 




input 


bytee 


2 




output 


bytes 


2 




input 


pixels 


2 




input 


pixels.bytea 


2 




input 


pixek,bytes 


1 


■MM* 


input 


pxxel8,bytes 






fixed 


fixed 


2 




fixed 


fixed 


4 



[0083] 11214, 3 ^7 is F-foittCtt'tZM 1 
0rof-?!7- h\ t^7VKEIf 2 8 2<n*f—?y 
- Y? v 5/ h 3 0 0 £: , 2 ah*? >- K4r*lc*ft- 5 
7- K7t-v- 7 h 3 0 1 i ?;/TttV^„ 



10 0 8 4] 
[3M] 
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what 


^*-st^a-K- 12-13: : ^K^>KCA<e 9h7*7*SJpC 
**d^^**^«c Occ.dmr Ui>**tt cc=l. nonna!lze=0 fcLTSttlC-lr? 

jL*y -J l*t- lo.TH 


L 




if 


Pltt^a — 

3=m 


S 




C 




P 




bo[2:0] 




R 





[0 0 8 5] ±S£cD^[£*5^'T:, — i£7- K^t 
— K©#&lC|i, 3^ot' / /f2 2 4 *S 1 o©rtSB-7 f — 

(DWr&iziZ, =7'Dtyt2 2 4^<-o^f-^4: 
f-^/i-LT r^^/p^j Sr#S. WE** 
©Llfy htfifn 7—9ft&<. f*— 

[0 0 86] El 1 0 IC&V^T, -Srtt^cD^y V K/ 
JBl*!7— K2 8 3-2 8 614, K&JtOftfcS 

WS2 3 5H:, *^*-Jll!»-C«-£1-*. **JK. 

(Eli i) as»j*-e*)4»&jcri3i9-Sr4i«-*-a. ft 

fc, CBu s 2 3 1 &;frLT«*<DI^**&!8)£-t5 
r t K <t •? , ife^-Mftft! 2 3 5 ^♦■frSrUfi' U ifrfrfc 



[0 0 8 7] /K^U-^^^[4ftf^[^C.TV , > 
T\ Ui?x9*1£tt : ei?=>-*- /MCfcoT rfg^&^/g 

BCO^-rT- (£tft, c on f i g 1) ft, Atf» 

9 \t-mcT K £<D± %t£94 -/mfc® 

947 <$k%, c o n f i g 2) fi1-^-CO*^a^/U 
"f T'tt, US?* * © try h d*<b K Wy?it&W£ 
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[0 0 8 8] %m947<n>\si??\9 t LTI±ffi*©<b© 
*5#S-T5o M—094 7 control^ 
i7*9) itrt^X<D^'J=.—/U (US?* 

-5„ Controll Ui/7,9\Z, 7 KU-^Mfi&f©:*: 

\s*?*9<D%Z.<!)9<<7 02tPk c o n t r o 1 2) 

«, t*y hrttc^^nSo 

[00 8 9] M&L<Di'isx9 947 W^f 7 -7 Vis's 
-fey hSftfctTy hlc r i j ^tftd>bff«i& 
y h£l^**l*)K^tP, i©.fc 0t£947OUiS7.9 

\ttK^h<r>^is =.—)Vfrh<r><< > 9 77 Y /*-7—\% 

[0 0 9 0] 3/at-7t2 2 4«# ; e^a-/H4, ^ 
^■SrSff«fT?li , 5^-ttflB©i:#KH:, CBus23 1± 
©c_activey-Y ^£-fey bfSo £©fc#> N $J 
-frfaffltiH 235 i±, CBus231 ±©#^=.— 
h<Dc_a c t i v ey-l'V© TORJ ifa^d* 
HTUfc^Srffifif Sit a-*^*y 
^2 3 6 bffim4 >-97x.—Xfflffl*i>x. 

ckground7-Y^5rix.5, ^<7y^ 
ft, n — jiflsfeyjisfryx.— ^irJ^ia'T^^^i— 

[00 9 1] *-/<777°n- */WDMA^CD||fff- 
^/WS, ffi©ift^©3gtTif-f ^itt&fcS. 3h- /< 

2 3 5fismc*-;<7y7°'fo&&mft£flX^?>frk'o 

Wtt-s<7 y T^iS^HOft*- KfcfcoTV\h/tf, 

ct-z^y/^^tu mmjy97*-xMm 

£B2 3 7^»n-*/^^j;M»|S2 3 6Sr»J«L*^S- 

mn-tz, isi>x9*mi&Lm7UtL>. ft&vm&x-fo 

❖#»Ti-*©«r#fcf fc*<jHW»«2 3 5f4^^^ 
»T^?, ®M^V*9. **1M>*W £H 

[0092] t . ^wMfcttstfe® 



tTlcS-r5^IB]<D^*^r)^v\ 4HMNMP2 3 5 
!*> !^T©^#^iiofc^-e^5r7 B U7xy^1- 
5. 

1. m^ft*<D^tf4v9 77\-*?fo!k*Xt£^ 

2. ^ffiUfft^^s^^v^^-C^v^ 

3. &©i*iH** Y]}—^7V7^i7^mX'hZ 

t5wi*4<, SM^T^CBus23lJ:© 

c act i veto background7-l'y5: 

m^ZZ £©^£tT5„ 

3. 8 ^©JffllgPWU^^^CDSif 

4i$i>WTi-z t % #^sg#pins 235 osr lv^«*s 

a»6©T^-fe^i:©|B|}HolBJBSrWt5fc»KHajcff 

£*r$*vc^5»£-K:ri % tv7*7«$ti5i-e 

2. a«l*i^**©JE*r 0 ♦^WHWes^W'** 
■eftVMi^fctt, 4Mt1M>9 (ic_ipaiic_ 
ipb) £^©lMX#tf2it]$-£5 0 i^X:/^© 
kZti, V\>7°9c?>m>'fo<frX'<y9\cxi--b'£tl 

5. tot, y>^#4-«qMb«e- FX-btv&mr 

Ui?7s9 ( i c_f nati c_f n b) ftWHOrr&Z. 
[0 0 9 3] ( i c_s t a t) fc*FU* 

±C Jy9yy*b\ctt-f5ftik1>mW)ftM\ct£^tc 
<0. i^-^Cfci^Cll. ^«»9J2 3 6««cih 

y h (a_p ausei b_p a u s e) S:iry h-f"5 

i*ie>©tfy hSrOfcy-fes' b L*»tJttf*&*v\ 

3. l^y^t-f^PW, CBus23 11l:c_ 
e ndflr^SrjSffiU 3 7"Pt7t2 2 4ifOi©*y 

h©*Wtt, ^T©^ift©tt(cjittS$tL5 0 
a. r>>-^y^#^Tj -f^^hjjJtcfet 
#. 1"*fct>, i^TU'^^ ( i c_f n a t i c_f 
nb) is— try^giflr-tiU y 9 77 Y*s—' 7-y^##t 
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5, 

1. ^Wy^l/-^? (i c_i p a t i c^i p 
b) 

2. ToDol/-^? (ic_tdaiic_tdb) 

3. iU^^ (ic_fnatic_fnb) 

4. -Y y^^/h US?*^ (ic_intaiic_i 
n t b) 

5. #j*U^^<t»<0*Jhtry h ( i c_c f g) 

* T * ir 7 * T *s^«CT?S# $ tvT V n 5 ffift 
fc, #^ffi#PA2 3 5 Ht 4 &mZM&1-Z>1t!t)\C 1 * b 

[0094] ituifiwj; 5 s^-^-^aai^aaft*- 

JOLT^t, OxFFFFFFFF^OxOOOOO 
0 0 0iC7 5/lfy^$tt5 o WfeOttiiW^ 
h l/i?^.^ (ic_intatic_intb) 

1. >-^7 7"b~>— Wy^77"M/^ 

H) x ^WWfctttt*^***© r 

i&tTj ?<§tfy Mic_s tat c f(Oa_p rime 
dt b_p r ime dfy H fc-fes/ Ft^:^ r«> 

2. 4i/9 77Y*s~* s s*tt&WT*s-!r^xtt 
<fc9t> r/h$<j , hy-Afc^T*— 

y ^^'>-^V^f » (i c_l oa 
i c_ 1 6 b lsiSxfp<D&) tn~ X*htl 



tf, ^t^mmnti c_s t a t l/^t©a_o 1 
p r ime d foSVMib o 1 primedfcfyr- 

4. 4 y^77*h-y-^y^#fis|l7-y-^^ft 

J: 0 t> r/jv$ < j , h y — AtefefcvtJl— /<5 y 

[0 0 9 5] fl-gB©^— Mi, tftlU'v 5 ;*^© 
-f^^/FfftVh (a_pr imed, a_ol 

primed, b primed, b ol pri 

medfy H h-tZZ. 

3 . io ri^ffiiivffi 

01 3fi, ^ftj»«fl52 3 5 UfcH-Cfc 

T'n-fcylr^ 2 4<D^©Hffffl^iSr < i e ai-SII ; rr^ 
SB 3 0 5 Sr-^tr. 3*fT*l*l«B3 0 513, ^£iR|ft{ff 2 3 

^«*»3 0 6 4»^«*»3 0 6tt, 7° 

*7f/^7r3 0 7HM«rSftL, i!fri£<z>iI9« 
^I^S3 0 6lt i,<D37*ntyt ; ev' 3 

5„ 7 e U'7is/^/<s'7rS'J»3 0 7{4, 7*l/7iy 

f 1 ^ y 7 r ffl^tf^T' u- 7 1 ^ f-^ y 7 r ri> cogg^ 
5 1 t> ic, 1*HHS^-H 3 0 6 

tA^y?7i-^^ s/^2 5 2 (H2) icorato 

^O-^i-^Srtfaf-S,, *7c, -f\yy ^y=f-/<y 
■7tMW$K3 0 7ttZ10(0^7K^V^I/v ? ^^ ( i c 
_i p a b i c_i p b) OSgf^tta+So 
«gP2 3 5. i*©^a-/l'2 3 9 (02) , 
V9 7zz—*UMU2ZS (0 2) ^P>WCBus23 
1 (0 2) ^<DT7-fe^H, HoO^^a- /PCDT^-fe 
^S*ffi-C©li^^5 fCBu sj sSf?§P3 0 8(^*3 
l^-CfrfrftS, g*«CBu s 2 3 1 Kiora*©* 

[0096] luii, 013 (omnmmn 3 0 5 % £. 

1. &cD^£rfcDifr-£-* h y-A*»fe©9W1-*>** 
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2. Sttlfr^-©? my U 

6. 4rftPttTlfc&i'3?*4 r £3Ettt-5. 
[0 0 9 7] 3Hffl)*p»tt:, £fls©A*llfT*-f ^/W* 

ffai-5*#*3T*t««3 i o (sat, 

*) &«x.S. 81514, ±3e©A^HtT^^^«rff 
S-t 3«MB$B3 1 0*cffiS^IiI5r*bfclIT*foS o 01 
4 tC&^T, , £ftmW& Ht*^ l^7iy ^SkaSB 3 1 

#ia£*7 5o 01 5«J*£Bt;:*5^"CllW&3 1 2fc6t* 
A¥-f2> e 01 4Cl/^^SlS|i3 1 7(4, WSOfo 

S&a&fr 5. »T^# (i c_f nati 

c_fnb) (ic_inta 
t i c_i n t b) fefeltttU ri^-4ry^#M» 
TJ 77 b ^5 5 

<b, U^^f=3lgl53 1 7^tf 5„ 5Ct' . Ui/^^ta 

SP3 1 Tli-fy^^T' nwMast>f?5. tw^yy 

^MM8 3 1 814, i c_s tat W^^^'t'O®^^ 
«tr y KDtaSriiUT, /<5y^**«X»T*aa 
©ta£*r5o *fi*l#P»W\ 3 1 0 £0 1 

3©ifo^«^S£3 0 6 icofHcD^^^-^&fT^ 

[0 0 9 8] 01 614, #^-^SP3 0 6SrJ;9f¥»BtC 
-^Vf-3 2 1 Sr#x.?>o ^^-^yf-3 2 1(4, # 

■>f^3i2 tafti-^o *fc, ^«-§">-^-^f-3 

2 111, *^©*T8M!fe^l»'fr<0*-/'<?y^S*if© 



^m y F*s^iB$*j,TV^5*^3— K 

[0099] 017 14, 01 6 0^f-f^^5'ff> 
—}r>f-3 2 1 **9*lBfc:*UfcBI"e*3. 
^/•?5/f-^->-^rVf-3 2 114, ^ftW^-'Jr^fMfP 
Wi-v^a 2 4 i:il«!Lfc^v J a -/w#«^>-^> 
■frttiKW (W*.tf3 2 5^3 2 6) £{t;l5„ 

(0S;tf*3 2 5) J4, CBusf-f^-^f 
C B u s Srf ijffl Ltl* ©1/-^^ &-fc y h 

^f4">-^>f-3 2 ii>mn<Dlt#>\z.='zf v \s 

5-^^^P7°n^9 S y^^afc^^-^^f-3 2 1 
Ciotty hTy^ftfc^Sr^-fo 
[0100] HBKttx *^S^">-^>^-3 2 1 I4# 

5?a-A4»J»Wf*, f ^-fe^-^^if 2 4 6 (P 
O) , *?—9*fy*S3.WW&2AQ (DCC) , *<< 
7VKt-^fB24 7 (OOB) , ^7^K* 
-#-?M1fC24 8 (OOC) , ±f-^2 4 2 
(MDP) , tt**— ^>f1f24 9 (RO) , JPE 
Gxya-y2 4 1 (JC) *if©«)«**l,«*5?a— 

»23 8.(EIC) , B-*/uy*!J«»«2 3 6 (L 
MC) , ^ffl#pai2 3 5 g# (I C) , KM 
i-^yf2 5 2 (IIS), 3|t#*5>»— A' (M 

m) ftif©*s?»-/wi, ^«-^aa«pjci4«^$H 

5itl4^v\ 

[oioi] ^=.-A'££>_BfJM# 

[0 10 2] 
[f?5] 
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& 5: Module Setup Order 



'f Vi K ^ # =i 2/ ~7 

1 -"^ •'^ I ^ s Z2 ~r s s vv 




\ D 




PO, DCC, 00B, OOC, MDP RO 


1 


CSC 


PO r DCC, OOB, OOC, MDP, RO 


2 


JPEGft^Ht 


PO 'dcc, OOB OOC JC RO 


3 




PO, DCC, OOB, OOC, JC, RO 


3 




PO, DCC. OOB. OOC MDP RO 


2 




PO DCC OOB OOC MDP RO 


2 




PO DCC OOB MDP RO 


4 




PO JC RO 


g 


DMA 


PIC 




■ — - 1 

ro /m r-^ 'J x | ittji»*j 


PO DCC, OOB, OOC MDP RO 


G 




PO r DCC, OOB, OOC, MDP, RO 


4 




RO, RO, RO, RO 


7 









[0 10 3] (HI Utto^X, /vmj&i'-'r 
(fi!xli3 2 5) ttj^*l'S?**T*1r*«Hi* 

fCs i»Oi"-*-^HP«iT J'J/ 3 2 4 (i 4 fitr^ro 

5. 0i8fi, J:©«k:«o-cia»i-5*^a-/HJMi 

3 3 0t?SLfcE!T?fe5. #*5?a-vMftfc 5/— 

[0 10 41 01 9tt, 01 3 C07° !) 7i -y^s* V~7 7 

^y7r««WJ#- Oa/ntyt^ (6X32 

3 3 5 Sr4|*.<5. ^5 7xyf^7rttIB 

u s {ryy-3 3 6 (C ioTftMBSftS — 0©#*ii 
^#-hi, *fiNW»(B5x #frW*»CB 

u s-f *fc?**-*rfc2SttH-fi— ool&ii* 

#-h£it;t3o I Bu < 3 6 tt, T*!)? 
syf/<y77 3 3 5C0A7jW^^7a:— 

»3 3 7fct>#*.*. TK^ffSttS 3 7W\ i c_ 
i p a *>5V>tt i c_i p bco — r>£§iRl-A7M 

c_i pafe^^fii c_i p b (D-oSrtlAD;*^^ 
tit, i c_ i p a t i c_ i p b l/^^'ti'f V7* 



3 3 9 (±7° ^ y y y r M'®U 3 0 7 ©£fls©fW 

3.11 ^-i^a— /1/D- */H^i^^^7r^/KO|ft0^ 
01 3ic*LfcJ:5fc, *^rt!»P*5?a— /^S#Sr^tf 
4Hes?*— /Hi, 02 0IC^L-C*)5CBu s-CV^7 
3 0 3 1 1 h Ltc 5 3 0 4 <D 

Y*m*.. CBu sS*^g»j-M(t5i:i:1>{c 

/KDi|!IJ®f4, CB u s-Y y?7i- .X 3 0 2 Sr 
jfi>T l e t Sa.—;l'*P<D l/^? 3 0 4 tC#^j^tf - t Id 
io-CtTiJtvS. CBus!!6£B3 0 8 (013) li, 

#frffiw«2 3 5, ^ww yfiy^-xi&Wto. 

—)\>v> zl — /Vit c B u s §r$!l^ U CBu 

[0 10 5] 02 0f3\ £^v^-A4::}o^-tfflV^ft 
5CBu 3 0 3©^Spfl|/&&^Lfc0 

T*foS„ ^$CBus-<y?7i-^3 0 3liCBus 
3 0 2^fc(7)^WbS* J ^»#^S*SrS{f-t-5 t 

3 4 l&^LTX3W3ft5V'S>**:77'f<rt'3 04«rf 

A'<D/*vmiL<D3im%n?w&y'(y3 4 4»o 

TV^„ I?CBus-(y?7x-^3 0 3liCBus 
[0 10 6] T c _r e s e t J ft-^-3 4 5(i^ipCB 
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us-f>?7 * 10 3 ftcD-f CD ^ ^f7 

*A'MK»K:*y H-5„ L*»U r c _resetj 

ffl-t Z>VtB-v»lt})±v h Lftl\, ^<£>fci£>, r c _ 
r e s e tj #CBu s fcLSf t^ffl^ftfci LTfc, 
S«E*631ttflfr*»L60»T?i|*T-t-*rilC**. r c 



c_err « J) 



[0 10 8] 



[0 10 9] 



intj 3 4 7, r c _expj 34 8, r c _er 
rj 3 4 9f f fi, &Tn*XZ&3\t*X t t*Sa.—A'« r 
r_i n t be r r_i n t_e n Ui?xf (Dfa&X 9 

[0 10 7] 
[gitl] 

error[i] AND err_mask[i) 

lfc2] 



c int= > intemipt(i]ANDint_m8sk[i] 



internal i|not reseivcd 



[|Sc3] 



c_ exp ' 



exceptionfi] AND cxp„ mask[i] 



[i]no«i 

[0 1 10] {f-S§- r c _sdata_inj b l" c _s 
v a 1 i d_ i nj 3 4 514, ^-^a.— /^Jco^-C©|fl 

_sdata outj £: l"c svalid ou 

tj 3 5 0(4, /WjiJcO^TCO^CQ^-^a.— 

*3 0 3coatE<t tTf4£ircOt)CD^*n$o 
1. Uv^WS^ffiL/^ii^tS 

3. v^-wm^&^/mt^mm 

4. ^^j^.-^nm.m/'s.mmm 

3. 12 w^^^^aiL/st^ta 

^*PCBu s-fy^7i-^30 3[4CBu s ±l£2ffttl 

£gf?fl1t5. I?CB u s^^i-T^fSI-S 
CBu s^t UT^T<0 2aig$>5o 
1. ^^A 

-X3 0 3rtcOU^^[dl, 2, 3, 4^-Y hgg^tt) 

L/##&*i-5iW£-f3. «t ^ 

£C5„ **5, ^^^.^St^/Bi^tBLcD^-f 

-1- —/u Vit^fi^n rioooj t rioou ~?h 

5. ISJpCB u s-f i'*?*— *3 0 3 L 

^WLijf^-Ctt, I$CBus^V!?7x-^3 0 3 
f4, CBus&Scd r re gj 7^ — /U KfcMVvr if© 
\si?Xfimij\C r c _s d a t aj 3 5 0 SrJHSM" 

5*»*atRi-s. •t^a^-efi, l?CBus-fy 

^7i-73 0 3f4 r r e gj 7j—/VYb Tb y t 



V^^i-^ltT'-^^MI-tP^IC r c _sva 1 i 
dj 3 5 0 *mm-$-Z> 0 #$ ii*»fP#»T1-Si:, 4K 
JpCBu s-<y?7i-73 03 f4 r c _sva l i 
dj 3 5 0£j£fflLT&g1-5. 

2. ^-fl/C 

^-l':7 e Cf4, 1 OCOU^^^cpcO/^ hcDlotCtffcco^ 
1-5. ^tx-^tiiiocoy-Kt^ifte,^?,,, 

[0111] HipC B u s 1/97*-* 3 0 3 

V^^&fS^S. TregJ Tbytej Ten 

a b 1 ej Z^-Vl-KSHg^-LT, &g*^*-7VWi 

1-5. £*UC.fc!), &54tfy Mil-^TcD^-TVw 

3. 13 ^*y«i«R*wu/»ta*ffa 

IfCBu s^y^7s-^30 3I4CB u s±C0^* 
7^-^.3 0 3liS*^->' a -; W7 Kl^fcJgLT 

ffi#1"5r. £T\ ^ipCB u s sOfyjc—xitmWi 
Th'uxb. y*y«*ffiL/*#&*&?r5lf-7>^ 
i-^©7 KM^ho — :/{f 3 4 4 t §r£$1- 
5. ##&^IWC{4, i«CBu S Yy?7x^ 
14, fis^frbW Y4*—-?frfe%*V-7=Zi?x.-;v 

i'te^i-5. 

[0 112] ^*CBu 8^^71-^30 3G>tM£ 
14, CBus 30 2l©CBus^©^/f7"7-f-;V 
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yy^i^Z 0 4 ttiitl^^^^ 3 5 3lrjl§]^^— ^ 
<fc 5 ftf^W L/##&*-fffl#flSi5 3 5 
2\£Si<oXW®£ix5< > CB u sift^l^^Sfc^-UJ 
Lijf^-Cfc^ ^Wb/##^WIH5 3 5 2\$Ui 
M^^3 5 3H^-^H:U r c _sdata 

MPPSI53 5 2[JU^^7r-f^3 0 4Sr^^>— ^HC 

#&^ftlJ#P81i3 5 2ii&W£iB%-3 4 4 &£$U 

^3 0 41*, L^**jliR«f-g]5 3 5 5, ttiTjirU-? 
?3 5 3, ^V^77*H356. 317— 357, Mfl-3 
5 8£j&SB, T>"?X?^7—*EJ$.&3 5 9, 
a— U"^**S5 3 6 0<D4OCD 
SWfe*>b*J*$*vS. I'S?* 3 5 5te, SK 

*wu/»trii*«i*85 3 5 2a»e»©m-^ r r e f_ e 

nj (1/^^7 7^^^-/^) TwriteJ 
Tregj &«^U fc5u^^tr-r^-7*/Hr1-S 

?35 3 ft, R*aL/»*&*M#P»3 5 2/S>e>(Z>{f 
^ r r e gj WfiKJ&CXs WS?**R*fflU!»I©fc 
*tCliELV>Wv > ^^T ; — ^?rjl^Lc_s d a t e_o 

u 1 7-r^ictb7j-t-5o 

[0 113] 090.£j*a$3 5 6~3 5 9ISAA>t*fcK9 
-siHfcfflSftfc&tttfj^-flHS- 3 4 7-3 

4 9, 8 6 2)-*4riWS. <§-W7J^7-*fh^-f5¥ 
jfettKUeoiifJTfeSo ^^§3 6 Oft, S5fC*J 

^xisVxt^yv <d®& &wm vfc t § ten cfc «t 5 

3.14 C B u s flf $ 



«f3i©3i«3, CBus WUr**) tt* #*^a-yl/© 
£P£CBu s-* y^7i- ^tf©^v ? ^^Sr-fes' H"t"3 

bWbfrteZSK.. CBu sttlikTO-onS&iZGi- 

5. 

1. /u«rWlbT.51W*J' < * 

2. RAM, FIFO, #*S^a — /^©tfl&^Wfc 

CBus tttft-ft- T K^-f-^7*o ha/i'Sr/BV 

C B u s tt:R«tt**tt©flm**», r-* 
Sr®*-t"5 - t Id «fc 9 OT-7>-;^e>RAM^F 
I FO7- ^{CT^-fe^-f-So 

[0 114] CBu sit&r<D3-0<DifhhMCii*)ll!l 
1 . <IW*3SfT«*«MfcfrftWP«2 3 5 (H2) 

2. ^-y-yh *- K'<*»f£0S?Ti$©* 

u-<y?-7*-* i $mn2 3s (H2) 

3. ^CBu s-fy?7i^«$JlfeS:ll^ 
Vvfft<D^-Cfc, aHS!)*^^— A-fiCBu s©^*^ 

[0115] mr©*ft, »a*njfe0ij{c*3v>Tffl^s 

©KigL-O^CBu sff-^cD-oro^aSr^Lfct© 

CBus flre-jeit 

[0 116] 
[3?6] 
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M6: CBus(3*H£8 



Name 


Type . 


IV-ftnitinn it 


c iad|31:0] 


source 


tnstTucUon-addre&s-dala J 


c valid 


source 


CBus instruction valid 


c sdatar31:01 


destination 


status/read data 


c svatfd 


destination 


status/read data valid 


c rcset[15:0] 


source 


reset lines to each module 




destination 


active lines from each 
module 


1 /* tinf^ltriymtindJI S*Ol 


destination 


background active lines 
from each module 




destination 


interrupt lines from each 
module 


c_errt>r[15:0] 


destination 


error lines from each 
module 


c repine 


CIV^, C AlCJ Hal 


has control reaucst 

bus control request 


c entl.c ent2 


IC 


bus control grant i 


c end 


IC 


end of instruction jj 


elk 


global 


clock I 



[0 117] CBu s<Dc_i a d{f %r\5.T KU*?*— 

1. c_i a ditCBu s-ffr^T FUX&mWlZtl 
HMtt<f9A' (c_v a 1 i dM) 

2. c_i a d (*t'&*»«0 ^c_s data (fig 

( c _ v a 1 i d (£) 
<fe0)9*-*fv hfJ*-— A^c_s da t a ft ■f-trfgK) 

-*-<&. 

[0 1 1 81 02 lJCfeVvC, /<Xte3 2 t*-y 
-T HV*— 9*— 974 — ^KSr^tf. i©7<f-A'K 
liWT<03O©^^y (370-37 2) 
1. fj-fAWllH? (3 7 0) fcJu 37°n-fes'f-t I ©V'v ? 

*-KPC I ■*'(9A't:9ffiLX\i*Zj>m'(y97*- 
*fBHM|S2 3 8, #^©7t*tC3 7'l3-feS'f-tr«^tX 



[0 1 19] Ztlt><DWtftX*ii, juit-y-J 9 A><DJg&<D 
9 a y 9 9 >vt%>?—9 *M 9 A?b 4 5o 

2. ^^^Blf (3 7 1) tt»8f*- K-effl^e>iv 

KPC I 1M 9 fr&MfiLX^ZftUJ >97 *— 

t>&<, f-W^/Kic.sva 1 i dft^"SrfflV> 

3. ^'TT'Cill'ft (3 7 2) lt**?3.—j]s<ni/i?X9'¥ 

mt, ^<Dtzft\c^y°uKy-f%mi$.^x\^'fo^m 

m&2 3 l J $>tmCBus'(*97=-X\Z£vX&f& 
£*v5 0 947°CW}ftXi$T-9i-4 9M*ft<. x- 
9f&ti>&lM9/l'Wc-£-&tlZ> 0 
[0 12 0] £4trtt<Dj"( 7*7^-^r-U £1T©SK1 

cBusis^^y 

[0 12 1] 
[«7] 
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a 7: CBus b7>^F^^3>^^'/ 



c iadivoe (DflS 

****** *y 


» ~r if t± *r ^ | ^ 


~ y n n x 


oooo 

VJLrLfXJ 


no-mi 




000] 


reserved 




0010 


peripheral interface write(/5] 21 "f 3^^' — 7 


B 1 








0011 


Derfoheral Interface rcaili'Hffl'Y V — *7 


B 








0100 




B 


0101 


peneric htis read/iSS^)/t>C^fH7 ri > 


B 


0110 


Iftffll n-Lf»rr%r>rv wrirp/ Hl s — Jly >C ^ U -ttiA 


B 








0111 


local meraorv readf D — 3r ' J §?Es 


B 




^Jli „ - - 




1000 


mooter write* U £ Art 


A 


1001 


recrtotfir t^sAi \s i/jL % SKlt 0 ^ 


A 


1010 


mrwHiilr mFnt^iru wrftft/'-F*'^ hp 11/ 5^ *F" ' J 

IJIUULllU UlClXlUlj/ WliiC\ »- ✓ — *» 7r A* *— > 


A 








1011 


mfviiilp mpmnrv refld^y^i*"/l'5(f J 


A 








1100 


test mode write/ K ^ — KHJi^^M 


A 

4 


1101 


test mode readfjT** ht- FSHfc 0 ) 


A 


1110 


bitset (tfy Msy h) 


c 


1111 


reserved 









[0122]/ 

7 *f — KTfc 5 „ w x ^ 7 -c — /V K (4* 5^ a — /nfi 
<D i£<D 9 *3EIW* 5 7 «f —A K"CS) 

^■f57-f-/UKT?*)5, RAM, FIFOfrifCDTK 

ty h&&4t<frtf%^tbtiitbz icm&.ztiit'H 

[0 12 3] ftJJBwat), CBustt^a-/V:*t 
(C, *S?a-A'*s**3*|(f , f«)itK:a6fflS^*c_a 

c t i ve^ySr&t*. *^*IW«i!ttroflr*K*<J 

V>"C#*<0*TKpS:ftl*i4:«*-C#*. *fc» CBus 
a — A' wtt/<y^^7!>^ K-=E— WKItP 
"f"5 c backgroundy-^^Sr, Uty h, 31 

3. 15 3^n-feyf-7 ? -i5'^^7°tx-^^ 
Bl2(C4oV^, ^rfn^y^-%2 2 4©||^ J P 

^ : e•r/^^Srfflv^5 <) ^Si57 ? — #7*— *ry hfi, n— * 
W^^y^i-^PC I /<*t£Z(D ^7°u ±y 



•?hZ>, H2 2Ht, m*<DAjj/mt)7*— 7? hSrm 
*WK:^Lfctat , *5. W)*«7*-?}> b 3 8 1 

?}7*— ?y hT-fcSo itv??©^— 

HSr, J PEG^^t:|g2 4 l^±x- 
?/^SP2 4 2^AA**l5A*rt97*-vy h38 
2iZW7*-~?y h-fZ> 0 r^f 5 2o<D^f6g|5« 

— ?y h3 0 4jC**-t*S. 

[0124] mmmva, 9Wf-*7*— ^yv\%z 

K4o*-co*f-^r*A'Sr*U #^-r*A*l, 2, 

4, 8, *>5l^ttl 6 tfy YV>'7°/Vi)>b&L'Q3LiX^ 

5, ^z-m, #^^1^ h 3 8 

t(C2 tTy S(Df-ir^/W0, < 9 1 Y^/l'2<^ 
-^BEff^3 8 8T?fc5. 02 4<D]*C<D0}3 9 O-Ctt, 
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G?-**"?*?^ b&3 2\fy M7- K**U ^ 
^/uz't\C8 b*y Y^-f^A^-\^^irzf^^^ h39 
5^$HTV>5. 02 5CDf?ZCD01j3 9 5-?(4, Ify 

^ri-Sf-^^^^Ki^ h 3 9 6#^£ftT^.5„ t> 

[0 12 51 

l'U b<Dfrtf%%}XhZ>£ote3 2tfy M7— Y<D*s 

—fyxx-hZo rcoy*— h^^^jgi 2 6 ©3 9 

9 t LT^i^-CiSi?, &!7- K^O*-'^ h 4 0 0 

rft&j 7*- Ft ^x#mtstiz*7V*t bx-$t 

-$(£, Z-frbto?- 9 H4fe£ 

(0 12 61 =J7'n-fe->f-{i4 0»rt§|5T f -i?^^7*Sr 

T?$>9, ft^(©3 2 t*y M7-KSrl^^T4T^7 1 w^ 

hc03 2 If y hT7— K^W57d— 7*/ YXh 
•5. 0 2 7.C, !7-KiS4/^htfc5A , 5 /?/^h7 
bcD#|4 0 2^1% 02 8(^^1-0:00^-^ 
9<izf\$. rt^-fe/UJ 7 T y h "Cfc t) , 47?f^ 
-f'M hf--V^/WCD3 2 If y h!7— K4 0 3fr<bf&Z>7 
t-vyFtfeS, iWtf^-fe/V'?*— < ry M44OC0 

[o i 2 71 0 2 9 K^-f fccort^-* 9-i 7\t rr 

i//<y*/<-f hj 7*— Ty hT-$>9, #!7-KH:-0 
IS, 7^?^7*/W Mf-^W4*/]^ 

ffi0>rt»f-* M4 rflbj f-?7t-v y 

hiLTE#$*T,So JMB?*— b<OA7rr-y-(4 
S§J*I^SI57*— -?y hicm&ZtlZ. 03 0(4, ffi* 

w^-- -fi^-f\ciioxmn^i>n^y*— ?y m 

1 OfrbAtsy*— ^y b4l 1-^co^gW^lg^L.t 
03 1(4, »**-*'J-Wf , 2 4 9^«toTHtT 
?y h 4 1 2i^^7t-V5' M 
1 3^t0^^ffi^L-CV>5. 
[0 12 81 EAT, «*Sr*fi i -t-*»3l*J:«J»iWK:tt 

bji-5o *-f a2t7*-**wb7;*— 7? Ka»e>rt»7* 

(0 if fc 4. o Tffl V * t> h % ^mzfp LTV * 

5o nc»tt^*Btt7*— M i 6-t'fc3#, -ft 

97^y^^ by*— ^y b4 1 7(4, Ti"<y9 
!EMit4 1 8trffoTrtttTW<y^^ hiPfliftS 
y*—Ty MI 9*±*i-S. 7^'y^IESft4 1 



J-rfZ** hSrBiDKKJMl&fr 5. 033(iTW< 
y?E${l;&a£;^Lfct<9-Cfc£ris 4/M hf--^^ 
/V-Sr^i-5A73©5*>lo©^ h^-r^/KO^a^j 

[0 1 2 93 11 3 2 (CjbM^T , ^ y ? Egl-ft; 421 

(4, y^7.M)-A4 2 2 <$<nmm*y' v 5 ^ * h 
MJ-^4 2 aicIHM-&tta*fT9. ^-r 

(48 y Ktt(CttM$nS„ «*tf, 4 tf y bJUffi*^ 
-< h*ffi(dSg^i-5^g-tfi, 4 Ify HOxNft/^ 
UtO xNN(C^j?tu5. 1^ b&>±<D*7i?*t 

hro*a-(c(4§]t)itTd5=fffc.ti.5. ^hy-A4 2 2t 
f-jK-h$n^A73^^^^ h^Xli, 1, 2. 
4, 8, 1 6 tf y HJ-^Xt?fc5. r*Lbtt» * 

[0130J H 3 4 (4, f-^r^Z't (C (0 2 3(7)^— 
9y*—*yb3 8 6Zb<D}:oi l z) 2tfy htt5 3 
Y^<DKtiy : ~-9 A 2 2a*A73$ 
ttfcfct <Ds<y91E8Mk4 2 lco^Sr^L-CV^S. Hi 
TJf 1 — hft^7t-Ty H4 2 3(C/,eoT 

[0 1 3 1 1 03 2(C*5VT, bT^-fe/U^ h y— Ali-t 
CO^. /V^14 2 5, 7y^y^M4 2 6 1 gPH 
S^«ta4 2 7CO^-fn^(cS?3tv5„ 0 3 5tts< y9 

4 V\c;<y9Zfrtz'^ b* HJ-A*s^«5*b5«T- 
£^LT^5„ BP*>, co^r^-r h^hy-A43 
0d^!7-KrtcO4T^x^^W h£MS-f57*— ^ 
y h4 3 1 (dJEiii$4x5o 7l"<yyim4 2 6(4li(^ 

- KO*/Jv V h t * 5 0 0 3 6 (4, /<>WMF 
y-A4 3 3ftTV'*y9$1ti&$k4 3 4*«#&tt*« 

[0 1 3 23 03 7(4S^51^4 2 7«iaSr^UfcfcO 
t?*>9, NSr^te^fccoATJ^^^/HS^-r A73 

* b v-&frt>umm*miRtzii!mx'h5o r>'<y 

9tmi* t7n b94-7V9±/i>l . 0iJx.tf4 3 7 Sri 
j&1-5i:#(£fflv^ftS 0 tr^-tr/i^-r*>H4* 
bfrbmtbbtlZo 038(4, J^t4 3 6CAA 
9*-^*sS*SiR»4 2 7.Cj:o-C£&$ft, 7n|«? 
-f/lf^W^y h4 3 7*5^$^5^^^ 

[0133] s*ajR*sfTt>n5 <t , mmm^m.4 4 
o (032) asfrfcH*. 0 3 8 tt%mAwmm<Dm* 

Sr*Ufc *>©■<?, Wf-?i/^^4 4 1 td*&j*ft^F*V 
/c-^ffi-Cil^S^SrA^#^, «o J; o \mt)Wm 2 
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4 2 LT^5„ 0 3 2fc*5V*T. & 

2 5 2 6, 4 4 0cDtti7JteU— >XV yftik 
M4 4 4{CjglbH?), H3 9(C^$tlTV>5J; 5i^ 

[0 1 3 4] El 3 2tC*5^-t\ -yZfVkMA 4 

Kftia4 4 6JC^S8(Hj:, -v/l/fa-^ HIRAM2 5 

91rt&=.9 HM»*5*«-e*>S. El 4 0 f*. «» 
«yi4 4 6*rtf*'fcA'5 f — *»cafflbfc*?--e*>*>, * 

[0 13 5] 114 l ft, ^KtoaSr^y^^-f hf— 9 
iriiffl L-fc^-C&So 04 2f4, WArt»7*— T ? 
h3 8 3*»6W*^«B7*— 7y h 3 8 4^-^Sr^ 
^-t-Tif 2 4 9«?«iaSr*Lfct>OT?*) 
5„ roD&aT-li, 03 2f£^Lfcf£l&&ai^t8<£>*!i 
34 24, 425, 426, 4 4 0 SrUtfd*, ^34 5 
0T*f4M{C®^SiR4 5 1 , #jE$^4 5 2, /Mb 
TKU"»^4 5 3, #tii^^*>'?'4 5 4<0#yi 
*^A/-CV>5o 04 3lC*L;fcifS|i#8&8ia4 5 1 

«, si 3 7 vmmmvitikmvmtm-vh *) . ^-Wif 

<D^f-^^<D^m*>Hi^n, 7*-^«@ 4 5 6fC 

[0136] 1441:* L1t#jEM{k*miii. 0 3 4 f 
#Lfc^<y ^jE»ft«Ml4 2 1 <DISB?R#©»ffc€:*- 

ttaasfrttfta. 04 2©/M yui/^y$m4 5 

fi, * h!J -AT Kl^^cDft/h2 If y t *r 4 Zf * 
hy--MC^i-S„ h7Fl/-»^i4 5 3t? 

i±, ^ar^y^^f h^fflv^ft-cv^t^ (H4 

5) , hf+W'bffiOft^'W 



A^ffllMbHTV^S fc#fi (EI4 6) , /<-f hTK^ 
y^ti^a— yW4 5 3tttttfj* h !)-A©»S7 

[0137] si 4 2o*t a^^^*aa4 5 4^04 

-r-^/w (^Jx.fi4 6 0) fc^**-*" aJaaTfeS. Sffl 

*V?±;\'*—1f-f-'t-9 ! 7 ! —$'Wiftui?x# (po_d 
m r ) 

Cf-^JJff^i 5 ^^ (o o r dmr, ooc dm 

r) 

*tt*^*f-f-fU'9 , -*tlf£l'S>** (ro_dm 
r) 

1. ^HffoKlUfca^P-feyf u^^jc»tatr 

2. S,Sco^tcX<5^^■r=■7 , '^-fe ; /•^^g#T*^^$^^ 

* & t*cD ± 5 fc^-r 5 a^^i-s&asrtecD&a t 

i: fc < »T LT L* 5 £ £ 1 fc5„ ^i-Sx-^IE^ 
rsj try MSOT-fcfttf, ayntytuf-^ 

[0 13 8] l4 8ttf-^i^l^^^(?)7^-V7 
b£*Lfc0-Cfc5 o £TF<D^fi, 04 8 fc^Sftfc u 

[0 13 9] 
[S8A] 



*6: X-?Sf£L'VX$r7*-^h 



—j m U.L? 


Hun 

new 


I30 




is2 




I&1 




IsO 




suben 




n&ptkate 


II 1 ft < T f 1 In T* rtllB it H" it 


wrrnosfc 




cmsb 


0= #iE ttfc J&SexfrT-ewU-/ W hrow'N >-i bsaflu / \— 71 — 


normalize 


0= 1 e»KT-4*?5?x^h 
1 =2c !>hT — 7T/ VX-7h 

1 — k-^— *S T>7 V- 


bo 




P 




if 




IS8B] 






cc 




L 


O^JElV+HTr— * 

1 =*U: T-$^©#-f 1/* 


what 


1 = J' — <r»WU7 KUv-»y 
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9: Expected D»U Types 



[0 14 2] MftZtlZT*-??-!-? 
[0 14 3] 
t*9] 



[0144] t£ 



i " — — 1 

Instruction 


Operand A 


Operand B 


Operand C 


Result | 




(Pixd 


(Operand 


(Operand 


(Reside 




Organizer) 


Organizer B) 


Organizer C) 


Organizer) 




ps 


P* 


ps 


pxcn 


ps 


ub 




ps 




W(B) 


ub 




ub 


ub 












const 








GCSC 






mcsc 


mcsc 


mcsc 


mcsc 








ifl 




scsc 


scsc 


scsc 


scsc 












(B) 


(B) 


(B) 


(B) 






JPEG ESS 


PS 


pb 


el 


ct(B) 


et(B) 


ct(B) 


ub 


ps 




us 




CB) 












JPEG ffifi 


ps 


pb 


fdi 


fdt 


fdt 


fdt 


I* 


ps 






sdt 


sdt 


sdt 


sdt 




ub 








CB) 


(B) 


(B) 


<B) 






Data coding 


ps 


px 


el 


et 


ct 


ct 


px 


ps 


ub 


pb 


fd< 


fdt 


fdt 


fdt 


pb 


ub 






ub 


sdt 


sdt 


sdt 


sdt 


ub 










(B) 


(B) 


(B) 


<B) 








skd 


skd 


it<B) 


it(B) 


it(B) 


it(B) 


px 


PS 


Ikd 


Ikd 












ub 




ps 


px 


mm 


mm 


mm 


mm 


px 


P s 




ub 




w 




(B) 


(B) 




ub 




ps 


px 




P* 






px 


ps 




ub 


pb 


ub 


pb 






pb 


ub 






ub 




ub 






ub 








ox 










px 


ps 


181 


ub 


pb 










pb 


ub 




ub 










ub 






ps 


px 


PS 


px 






PX 


ps 


m 


ub 


pb 


ub 


pb 






pb 


ub 






ub 




ub 






lib 






ps 


px 










px 


ps 




ub 


pb 










pb 


ub 






ub 










ub 



























































Kio] 
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3c 10: i/>mV 





torn 


OS 


racked stream Sttfc A hi)— A) 


ob 


packedbytefi 9 £ tiitJU h) j 


ub 


unpacked bytes 0* V £ £*Tt^fr WW b) ! 


DX 


pixels 


bl 


blend (l&a ) 


odnsi 


coast an I (3£*&) 


tncsc 


4 cutout channel (AtiOl^^r^JV) 


6CSC 


1 outout channel color conversion table ( 1 fcHTjA ^-^fflft^ — jOM 


ifl 


Interval and Fraction tables (BBSSSr/ffl^^ — ~7 M 

1UIC1 VOJ A LIU M. IOUIUII laUluu VI Will IJ i*. V *-Mk*JJ J * f w f 


cl 


JPRO enrndina table (JPEG ftF^fll'T* — ^/JW 


fdt 


fed JPFO d^codimr table (j£x£ JPEG ffi-^ , 5 = — !lf JM 


sdt 


ojnw TPFfr decoding fable JPEG tK^T* — ^/iP) 


akd 


«;hnrt kernel descriptor flEV^A^^JPfBifi"?) 


Ikd 


ion<» kernel descriotnr (& V*^J •"^JHEiJfiT") 


mm 


mfltriv cn-eflficienl table {fr^fHlfeft'ir' — ~f)\A 




image tabic (H«x-7;W 


(B) 




<T) 









[0 14 6] 3. 16 T-fjEmtm® 

ny>ft x fc^yir/l^-tf-TMIf 2 4 6 i^^Kt 
— 'fl'Y'i "i^B, C24 7, 2 4 8^^— ^E^bgPl 
0 6 2, i-r— ^^2 4 2fc3V>HJ PEGSB2 4 1 
W^&y-y:?^ yy*:x:^v v #^-$!|^|f|52 3 5 f© 
7*n ^7 5 y^x-^xy H 0 6 4 tfoS, 
jfiftsSBl 0 6 2bf£:iyyyjry?X3L^i?>(DWift 
Ji, 7°n ^7 5 V^x— ^iV h 1 0 6 4^©#^^ h 
y-Al 0 6 4iC£^X&%.£hZ. fclfrfrrfcfc, 7" 
n/7 5 V?*--*?*. yH 064 tt«*«HISrff V\ 
Pff-f-1 0 6 7 & 1 0 6 Z^'s^T 



ny^KHJAti. ^ATJX-y 7- F 1 0 6 9 r t 
fc, jESfllWl 0 6 2«^^{tS<5V>Ty f -^<D7^- 

W77^y^iyi?yi 0 6 3IC&W1-3. 
[0 14 7] ^-y-IEfcftifStt, f88R(ctttf^-fe/wd— 
^-ffi:*^?^K*- c 3. 

9*-^*a§jK:jEafl:ufc«, jPEGfftftfca^tt 

W7 7-f y^i^y 1 0 6 3f4, 3 

2 t'y hw-k/uthzmmy*—?? y<D7 ! -$\zn 

LTftf£-r3o lot, jES^WWtA*^-^ *r 3 2 f 
y htr^-fe/U7*— vy b.Km&i-ZiUM&fiOo JEM 
ttto~*<DSj3?*-fi V- K 1 0 6 9 t> 3 2 tf y htt fr# 

^07*-7j/ h-C*>oTtJ;v\ ^y^S^A^J^h 
!/— Att. x-^^-^v 5 !^ hj^l, 2, 4, 8, 16 



^i^b^6^5 0 7y/^^/^ HATJ^ h I) 

-Att, 8 tf v hO'M hOD^S^T-fcS i 5 4 3 2 

t^-fe/WT 1 -^ 1 1 t^^^/^8 t-y Hi-e^a$ 

ix5i5 4l, 2, 3, 4j@<^$5 : --r;M^i><bj&6„ 

[0 14 8] 7*-?]Emimi 0 6 2<^A«: 

gB1 0 6 2tt, FIFO/%7r (FIFO) 107 
3, 3 2 fc'y hA7JV-^^ (REG 1) , 3 2 t*y h 
tiitl^^f (REG 2) , jQRfb^A^-T'U^lH 0 
7 5, fflWl 0 7 6H«§ o A^T'-^y-KlO 
6 9ttF I FO 1 0 7 SfclMASilfcft, (REG 1 ) 
1 0 7 4K1s<X<DAt>\?y h#Bfaffl*7*-vy h 

1 0 7 5ft, (REG 1 ) 1 0 7 4tf><D|l:i: (F I F 
O) 1 0 7 3»aW^t^P>Wtry h£8&1-5£t 
T\ R E G 2 tC 7 y ^ tT^ ■fe/l'Sr^J*-}- 3 <t 0 ft 

3 2m^-g-t>-B:^-r yf-^r«x.5„ gpib, JEMit^/^ 
7°W^lM 0 7 5fix [6 3. . 3 2] £ x [31. . 
0] t-C*$Jl5 2otf>3 2f y hA*7-Kl 0 7 
7, 1 0 7 8£A^-fS„ 

[0 14 9] r<D±5ft#i££fflv^<t-e, «Ftd^J^- 
J^ ^ a^c:*3V^TF I FO*y4< t^owW^x-y-? 

fcSV^i^-y^^y K^F I FO^y^rfolWtS 
££jJS&5#, A7jy^^'1 0 745;Mv^:i:-e i 
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^X%&£Ats?~9*ftmfctZ>Z.£ftXZ^ 
^IWeftkaaicjfcfto 1 4 5 S 6 4 5 latsgft^ 
fc*y M hV Z MM<D94 

■f<rym.f-9 v- KfliEJIMttWcAa £ns <£ 5 &» 

[ 0 1 5 0 ] Wm%tZ. R E G 1 10 74^REG2 
1 0 7 6£3E$ft3'f *-^/Wf-J§-REG 1_EN 1 
080^REG2_EN [3. . 0] 1 0 8 1 fc&jfc-f 
5£tt>K>FIFO 1 0 7 3 J ^IE^'fl:^/^7'W^ 
f-1 0 7 5*W»i-5«*Sr"b^«i-5. @4 9<D7°n 

5 yy^-i?x.^b i o 6 4\$7 : —?jE%ii\:Ui o 

6 2fc#LT«©±5*«J*«*fcS&lti-*-3. FIFO 
_WR4ff-§- % jE«fls7 7**-n [2. . 0] » tfy 
h^iryhb [2. . 0] , ft^*!>^hc 
[1. . 0] , b (E) tV^ofcff^-X? 

4 9frZt\CF I FO_WR{f-tl 0 8 5*3§W-fS- 

tti*), f i fo i o 7 3fc##a**i$ 0 mmm 

P>tt4^t#tif±, F I FOtef i f o_f ulltl 
77^1 0 8 eZmtti-fZo 3 2t'yhA*f-^^ 

5'?^M)-A7*-'7yh (E=l) T&S/^T^ 
y^/W l> (E=0) "CfcS^il^^tvSo E=ltf> 

SUNT** 4*. EP*>, n = 0ttl try HiOS*, n 
= lte2fc*y hH£*, n=2f±4try MB?*, n = 
3 ft 8 If y hKK& n > 3 IS 1 6 fc*y 

vy bfSiittUfcA**^*^ b©S**T?«fe*. 

Afr&icte, c = l ttft/jv<^>o**swSb-e*>5 tr* 

-fe/W, c = 2 fi*/h 2 b *s*»T»*> 5 f * -fe/K c 
= 3 ttft/hS/V h^#S6"CS>S f^-fe^K c = 0 lit 

[0 15 1] MJ-A#8 fc*y bWEHT^SKU 

frbf&ZW&Ktt. t*y btf-7-fey brt'REG liC&j$ 
$*Tt^5fir?&5x [3 1. . 0] f^x-^&SM 

JMWWMer*. try b*y±y h&KCtxDW* 
9*4 by [7. . 0] (DZ.&JjmmTtt-VS-Z.b 
n = 0 

y [i] =x [7-b] 0^ i ^7<Dk% 

n = 1 <DW&, 

y [i] =x [7-b] i = l, 3, 5, 7<Dt 

t 

y [ i] =x [6-b] i =0, 2, 4, 6©i: 

# 

n = 2 (7?i§£\ 



y [3] =x [7-b] 
y [2] =x [6-b] 
y [1] =x [6-b] 
y [0] =x [4-b] 
y [7] = y [3] 
y [6] =y [2] 
y [5] =y [1] 
y [4] =y [0] 
n = 3®#&, 

y [i] =x [i] Ogi g7©t# 

n>3<D®-a\ 

y [7. . . 0] =x [15... 8] 
tiljj7 ! -9'U by [1 5. . 8] , y [2 3. . 1 
6] , y [3 1. . 2 4] «r£/S1-*5SfcR«-C*><&. 
[0152] 4*3, gl±CD#i£tt\ h V -ACOg 

*&A#U &£4lalSjtotta^9&mMiiNi<z>lti£. 
^-y-^i^ h$r£fiW-5£ fC, v**»4**S*>UiAT 
l^f Sr t^riW"* r t 5 <fc 5 KfcSS-Ct 5 0 * 

^yf^yt^i^, 4*3, ±5£co^it?tt, 

£rfflv^4§£-|ctt, fy h^-7-fey hSrA^J/W hroft 

fc, A*s^ffl^«!)iai^ffi£i±<Dtt^tt, mtimm 

ttATJS^ 9 »r 5 , -«ic »1 a \? y 
bfcBMS^SifcfcioTa-jSSftS. JbStTftt, 16 
tfy b J f-9*7'i?~9 hwf^q h^il^-fS^ i 
K«fc*K 1 6tfy bA^®3H«r§J*)*TT8tfy hR« 

[0 15 3] i5 0©SJ|?Slin [2. . 0] ic 
[1. . 0] ©a^-Srtfv\ :ixf>H [2. . 0] i: 
Srffl v ^TiE^k•v/^^7 p 9 Wtztb (DS^ffi-^ R E 
Gl-^REG2Wfcfc^^-^/Hf#Sri^i- 5c * 
fc, F I FOfiifHS-* iz&^x&icftzz bhhZTz 

f y hfitfi i n_.b i t [4. . 0] fc, WTJx-^O 
##)i^Sr^5^^W h^Bo u t_b y t e 
[4. . 0] SrlSiti-5*»»^tfl|*5 D WfPAM:. 
M^HTlfcttA-e, i n_b i t [4. . 0] ©ffi 
tREGlcOftfctotT'i^y- b<0figt SrJt«-f5 r. i: 
X-XtiV- K«r*W F I F 0#sg*C4v> 1 * a y ^ 
tfvf */Hc*i^-CF I FO_RDff-^Sragffl1-5C<!:-C 
F I FOR*at»f^S:H»1-5. «*f i f o_em 
p t y, f i f o_f u 1 1 WtF I FO)Ri77m 
9, F I FO##$47*-*£;£l.-CWj:^£ttc f i 
fo_empty=l, FI F O/S^yKD £ # tC f i f 
o_ful l = ltft5. F I FO_RD^'Stti$^fc 
-y^^^yKci*3V>T, REG l_ENOiittl$ 
ffU^— ^*5REG UC®«?ji*tb5. REG2 
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[1. . 0] , REGlrt©»»WI'f'©*S*»*fc, 
REG 2{C*5V^*{£ffl5 1 ^^/^0 3oOfi^T-<Dft 

/hffiSrtSCtT?, REG 2 E N [3. . 0] £frg 

•f$o E = 0W#^-tC(4, REGltiai-oWtM 

£ftfcc [3. . 0] t^LV^fc, JS£*H«j!7- 

b t rune [2. . 



10154] *%w<n#mti:nMmx'te, mmukiEm. 

[0 15 5] 



0] = 



[2. . . 0] 
[2. . . 1] 
[2] &" 0 0" 



n>3 0Wrfr 
n = 1 CD4|£- 



( r&j \i\£> y hZb<Dm&im*7*'f) 

iroi 5 ft^SC: J; 9 , 11 5 Otr&l^TMUX o, MU 

xi. . . mux 3 i-?^&i%x^%&jEMit-?/^y° 

U^tOf'f^i, fflmmffe%Ffti'^^kZ<D3 2 - 1 
0-l£-«Sc$:ft,5. r©^XSB/hJcJ:9lHlKa« 

[oi5 6] gu^stc, am^mmx^ ^-9 
s. 

3. 17 T*-fe5l^*#-K©HfM&«»fe 
H2 £S2tc*5i^-r\ i^$fflgB2 3 Sfi^^n-fei/-^ 
2 2 4 Kj8V^TllfirSn5»fP»t»**H*#^Sr rH 

if. r ft e> co^rffl 1 ^>i>^f&9imx'h 3. 

[0157] 3. 17. 1 
1215 11*, ±x-^/^SP2 4 2|C*3^T*^^ix5^ 
jfc^fVufc^LfcBTfc*. -g-^X/V4 6 2tt, -« 
iC3ocDx-*A#y-*<fcti^x-* (^V*) 4 6 
3^if. A*y-^«7)lo|* x tb^j4 6 3t^*yrt 
X(DmCftttfrb<D¥?±sUf : —? 4 6 4-C&S. * 

wt^7y K4 6 5*-&tp. r:-?, fe^awaett 

7?yK 7UXK» k^-fe/K fi'fA><D£tn*h&. 



7 , l/yKM§li4 6 7fc*SV*-ta=J*3;h.5. 
Si-, AJjf—fitt* 6 5£j$«£ 

■tfSBSf*-* 4 6 6 Srt^tf. 
[0 15 8] ffiJfc©J:5fc\ iI»fc^-fc/)^-*f*S-^ 
■f^Atf* 1 hi|6 _ efc5 4oW^-r^/V^^)^5„ S. 

^«Kg©»#^ffltt{CB8UTtt % ATOM 
X fThomas Porter and Tom D 
uf f" Compositing Digital I 
mages" in Computer Graphic 
s, volume 18, number 3, July 

1 9 8 4 J fc4f©«!MB*«:#J»$*l,fcV\, 
[0 15 9] 37'o-feyf-tt7'm|t9 f -^^V>5i 

frk&lfihiXmW-lrZ&mxh&o ^©fc&, 2o© 
^>3^wSSIiC4 6 8, 4 6 9^ 
fcSfc, ^WRf-r^A 7 0, 4 7 1 
TVfsHU ^1^9*^*4 7 2, 4 7 3tr*« 
-&J&8&4 7 5 fit, St&Wlfo^-* ICS 



[0 16 0] 
[0 16 1] 
[SI 1] 



K1T©3< 1 1 (C % 
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mm 


(a^a^overfl^.lio) 




( a oo- a o> in ( b co' b o) 




(a^a,,) outib^bo) 


W w oW H o)) 


( a co' a o) alo P0'co- b o) 


(«co b o +b ca(i^ b o) 






( a co»^o) P ,us ( b CO' b t>) 


( wc ( a co +b co" I < a o* b o' 








255)^5,clainp(a a +b 0 )) 


(a^,,^) loadzcroOJco.bo) 


(0,0) 






(a^.^) laado (b C0 ,b o ) 


(aco,b 0 ) 


1 (a CO ' a o) loadco OwM 





[0162] (aco,'«o) tt,fiicfc!F 

fi-Cife9, Twcj tt^T-eaiH-f-S 7 v Vs>f/? y 



t>, ^7^/7 yfVj/«P4 7 6f4, $1JM0~2 5 
T*fc5„ ^gT-fottii, T^Sr^v'a VCD 

[0 16 3] 0 5 2 {4, -g-$&yi<HT 

-A'KHU ^-C0x-^^ M-.A4 6 4 (115 1) =1: 

Pb7-i'-^Kfi^2(Oy ? — MJ-A4 6 5 fcTV 
5 ^&T7F L, Pi7^ — Kl4g|5& 4 7 7 

C7-f-^K(ifil0-2 5 Srtlcys/^&Sv^?? 

d»£r^U r C om— codej 7^ — /l^KHf (Oir^- 

YW^t (mdp_por) Sr^-Sd^ti-CtS, 

T'fi, com-code7w — /V KI4^7ir s> h \s*J* 

[0164] ftiCi^/cIS 1 0<D^Jp^!7- Mff^ft 



KA»^9 V KB b b t>t^? y KBSri 

[0 16 5] 0 5 3 {4, -fW K#^eo^!7— K7* 
-vy hif-«4 7 0 t Ut*UT^5. "fW Ktt 
Sf4> «-^^*A'rtOil*6f[4 7 1 fc»Tffi4 7 2 t 
PJt&Kx H54Ht» ^7K^4 7 
6, Httt7tyh4 7 7, ft£ 4 7 8 \c£oXMfe£ 
fl%># 4 }^3t7 V h£^L-CV5 0 t^t©? 

^&3J4*^ 7— iftfCfTibtU 055(413 5407^ 
-^K4 7 6~4 7 8^t5@ffc5„ M^7F 
i/^4 7 6(4^ -T/u^^ey cogger Ku-^Sr, ^-f^M 
h4 7 7{4^>f/l'§8^Kffll>f>n5ftD30 
/V h£\ 24 j\/&4 7 8\i.y V7'1r%£.fc<Dtr4 

[0 1 6 6 ] m 5 1 {c&i^-C 
14 6 6{C,fco"C«Sl$-fre)iv53:tt<b5. tt£fltfi 

1. ^(0^-'<7>-KC!7-Ktflr®«7r^^Srv^ 

2. 1^4">"C, 0iS*7f&5t> FvyyiStt, 
WW^7V KC17— K^-Ctf 5/ h?i/7'©7 Kl/^ 

3. hTy7H*. ^©*-<5XKC7-KO 

4. h^TSrffl^T, KOirtK*^ 2 

So 

[0167] JfcSHIttflF^fc L<7) 0 ~ 2 5 5 (OSSctfc 
C o a = C o a X A/2 5 5 
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[0 16 8] 3. 17. 2 fe^ffi^^ 
02 iS2tfe^t, £.7 ! -?/<xU2 4 2tx-^=¥ 
•Y^-ya.2 3 0(4. ±tefeSMfc©«ui*ff5. 

7*—*V V (^fiCYMfcSWSCYMKPPBlJtCljS 

Stt1--<T^fe^^f-^- <fc 5 Jc^H-$tlTiJ 

5^i:^t?#S^ft!lffll§i52 3 5I4CBU s 2 3 1 
LT, ±5*— *>*;*»2 4 2, f*— aJMMS 
2 4 0, A7J'('^^7^-^^'l'-/^2 5 2, f^-fe/W 
■af-— 4" if 2 4 6, MUV^y77 2 5 0, ^r^-yV 

M—tfTMif B 2 4 7, d-^^Ksh- It-tJVc 2 

4 8, 8**- #7-^2 4 9«:«J*U fe*!ft*-K 

LT±7'-^/N , ^gI5 2 4 2(C 1 t^-fe/V-y^ vrtlCii 
W£ft5o ±f-?/<^Sl!2 4 2 (02) ttA^-TV^ 
7i-^^ s/^2 5 2^^fcf^ir/^— #7M if 2 4 

^^MUV/^y77 2 5 0 fCfcP>^C£>n— KS 
ft, ■fe^^^-^^^y^ i 2 3 0lCO-K$tV 

5. if-?^ 2 4 2 \tzivh<Dm^^y -y YOr— 

fflfr h C YMfe 5 V M4 C YMKfiS^C tT^ i?/v£^& 
U ^^ft/ct^ir^fc^^tf^if 2 4 9£j£ 

•5. i7 f -^^^gC2 4 2, f-?^r>a230, 

7-^0JW2 4O > ffi©lffl*©fW:*tt* *^WW 
§152 3 5<£4lJ#PO-b£-C, *-W73-^feSPfl (SOG 
CS) *»*-K&6^tt«*W#-*ltesSW (MOG 

cs) £&^e- VnZhbfro*:- K-cidf^-fSo 

^^^ S /^a®^lgl5 2 4 0 J ^'r-^^-Vy^3.2 3 0<D 

nm\c m lt a, r^- y- * ^ y ^ ~mmn t**?^ 

a J 2 4 0, 2 30 (02) <DJf i£r#Ji$tlfcV\ 
[0 16 9] ]Ett«&SmKfft^H:«lt«^jM^«Ui 
-Cfc5 0 03;il4\ RGBfyir/i^kCYMK&SRflcD 

mx-t 5 mi?** * fc*»*-c#*»tt c x t * 

(It r^y? , H) ^<Dfe£lig£&*a i 8lc:fc^-c i b 
!*]«?*><&. BP*>, #feR*fc*5VvC£CTL*5#ifc 

ens. z<DXoftmmft&m&!m<D&im&<Ditib 

0»J;tfi 2 4 tfy bcDRGBf ^-fe 



YMK££W<£>8 ITy h^fegig (~>T» (C^ytTV 
?1rZ>j\>yirT y^r— ^Mtl HsJUbfc& 
Hi-TSo PtBKx 2 4 tTy bRGBtr^-fe/WSrCYM 

-*UC*fL-T, i^-^/^2 4 2 
(0 2) tt N 7-^t«;^2 3 0KltMft**t,&/l/y 
^Ty7'r-^Srfflv\ A2jfe£W«f»0jS;fcfiv>tii;& 

a. *-W*-jR6ffiM (SOGCS) 3K**-K 
*-*t>tF^«ISW7J^,3E«l^-K (SOGCS) 
(MOGCS) ^tt>, RGBfe^F^(48 t*y 

& WfeS^Sr*-f 5 2 4 f y M**-te/n&»fc|fc 
5 0 RGBfe^(D#RGBft%lil 5©EIWfc:$HM3 
It, ^n^*1/<DEra<D*$»4^W^©RGB^tjCY 

«-W77yy v^wfe^M5rElt{cia^5^ 

A^t^^-fe/KD^-fef-Y^^w t fc, feg^|t(D15<0 
KWfrottltiSfti&fcH*. JfOKfflfcA/jfcSJMIM* 

— ^^^§15 2 4 2iCt5VNTfflV>f)ix5o 'b* J 5^, 

[0 17 0] ftl5*cDj;5(ilRGB©^7cf4l 5©KF^ 

3^7cyrw^fll3gt^o-C^9, EM<OnEJ«OA73 f 
^■ir;W4A^^^-C(4mv>iHgt^oTV>5 0 Kfc, 
EMwi^«tC^1-5tli7J^McotiS73fefflO^^;Wy 

^-fe/KDUJ^fefiSl±, A* t:*^ -fe/^5#fi-f 5 Eraopi 

[0 17 1] 05 6\t, A^RGBfet7t/H;»L 
T, »tS-f5KM^ , KIBrt«offifllSrft*1-*«4 8 0^ 

*fC^5. ^^8(4, 2 4 t*y h A^l t^-fc/KD 8 
fc'y hA*fe7 t +^/i'ri:^Erar-7'^4 8 2^EW 
rt(ill7 1 -^/V'4 8 3 ^fflV^T^^T$^^^.. 0 5 6tC*5 
^T. 8t*y hA*feK*4 8 lttl 0it»O4Sr/V 
^-y-^"C«*U*:t>0-C*<5*s, dW8 tTy hA73 
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feg^4 8 i #Effi7 l -://w^EMrtffiB5 l -?VV'^> 

l±, A^Jfe^^4 8 1 tf®tet% Qfab 14if©E 

m<Di^A\fy hT-w^-rSo hju-ikc, eki*^-^ 

/W4 8 2(4, A7jfiffigl!?4 8 l^#ffi-TSEPr5rt-C<© 
ftB£^-t-„ EFflfix— ://H4, 0/4>6> 2 5 5£-?©iS 
B©8 tfy h{6£iyrtLT:i3'9, C©«42 5 6©#$C 

tuT^§n5. iio-c, i omm4^^-r y--e 

£>P>fcLfcA7l6Egiii4 8 l©»£-tcf4, Effi-r-^ 
/W4 8 2£/UyyTy7 p -f5;i tT\ W7JffiO#£jiSc£ 
*v£„ *fc, ATJ^fcEF^fttt^-^/l^ 8 3-e/^ 
y7T•;/:/-?-5;:,^(c4 <? , »»1 6 0/2 5 
-rm^Hl 6 0^4^$ix5„ EH8x-://W 8 2i:E 
BSrtftgx-:/^ 8 3 a>6>frri>£ 4 5 (c, EFpSjft(4^ 

— e»*v\ mri£©4 5(c, ErafitteaH»<o#i»jg 

t£tc4o-Cfc©£>;}t5„ 
[0 17 2] ±i£©il!>, ^-RGBfeg*{C*fLTEFBl 

©KMB*i:3o©EIBrtffl:«HJ**s»?>H*. £6S 
ifi(c*f-f 5 EFffl/EfcWftB-r-^/WiMU y7r 
(02) {Cn—K^H, ®4ttti7 f -^^X2 4 
2(C4oTTy-fc;*£tL5 0 fi^gW&SKl&ltSMUV 
^*y7r 2 5 O©fflM£Sr05 7»C^"fo MUV/<y77 
2 50 (05 7) (4, tll«1#«HI#lC#JW-63 
t emp 1 1 =CV (P 0) + f 
t emp 1 2=CV (P 2) + f 
tempi 3 =CV (P 4) + f 
t emp 1 4=CV (P 6) + f 
&(C, fflHWkmt, ElT©^fflV^t emp 2 1, t 
emp2 2£#«>, ft (G) TjIrI© 1 ^7cMW©H-©Sr 
temp21=templl+f 
t e mp 2 2 = t e mp 1 3 + f 

f ina l=temp21 + f 
A7J t H73 1 (Dffimft-Wt. L4^*&t> U4 Ltf* 0 ft 
5. CC:T\ W*ttHi9 J A^«fflJ;9'b»v^, ffiffi-e 

05 9(4, £<DlSIH#£C$W«r 
ftH 9 , A7jfSffl{££ffi7Jf5iaffi(C 1 i^TC^ y t° 

W;tJt«*jft5 1 0 b 5 1 1 i: -C£*oTV5 

5. ^A©tU73«^,^5 i 2"?y y^T^ftStfS 

,»;, ,^5 1 U4;i©:fct£©l±i7J^fttt*ufftbfti/\, 

t£oT, 5 1 Ob 5 1 l©2o©££*tFA-T 

14, US 1 5#*W»ift>J, A7J/S5 1 6fCI4W73lt 



o©^*j£4 8 8, 4 8 9, 4 9 0 K#»7 f>ft5„ &S8*Sc 
(#J;tl44 8 8) (4, SIC 4 tTy h©EW^-^i: 8 
try h©EMrt&fix-:/A'<fcK#l7'b*i3. 1 2 tf y 
^1734 9 2f4£T ? -y-^*SB2 4 2(c4oTMUV^ 
y7r 2 5 0*»fc#A;fcfe^*^iKBUtt$*i 
5„ 1 0il^4©^-A73-fe®^©±^J-Ctt, 12f 
y h Hj7J(4 00000 1 0 1 0000t 

[0173] 0 5 8(4, miw*aa©««r*Lfce!-e*> 

5. ffiFfl«!iSI4, lo©3&jt£ffi5 0 0 (#J;it4RG 
Bfe^fffl) *»e>te©feSIH (WilJCMYji)5V^CM 
YK) ^©MHtf s £&&a-e&£o f^-tr/WP O^P 
7t4RGBA73€,^ra^-Cffi<#ftL-C*i'9, ffl^ftffl 

Ktfcv^^tstu^felcv (po) a»e>cv (p 

7) Sr^-fS. tfy-fe/vp 0*»bP 7©BBK:{fcg1-5A 
73^^-fe^P i ©ffl7j£JRf§ttf4, «T©45ICLT* 
*"f, ATJfy-fe/WP i £®?>lltpEF H '3©PJ 

SPO, PI P7Sr8tSi-5. ft(c, E^f*J 

fedH^ frac_r, frac_g, f r a c _ b £r 

lev (po) hcv (P7) <Dm&&.m?mmmm 

[0 17 4] ffimtm\±, (R) 7Jr6l©1^7cffi 

FsJ£1fV\ tempi 1, temp 12, tempi 
3, t emp 1 4©fit££lT©3><b#©5. 

rac_r (CV (PI) -CV (P0) ) 

r a c_r (CV (P3) -CV (P2) ) 

rac_r (CV (P5) -CV (P4) ) 

rac_r (CV (P7) -CV (P6) ) 

[0 17 5] 

rac g (templ2 — tempi 1) 

rac g ( t e mp 1 4 — t e mp 1 3) 

(B) ^©ft&jfcTcMIBJSrfi^. 

rag b ( t e m p 2 2 — t e m p 2 1 ) 

1 0 b 5 1 8 b ©ffiFffli&lf4, A7J/& 5 1 6 (C*f LTttJTJ 
115 1 9££j&-f 5. £©4 5&2o©ffl;>jffi5 1 7 i 

5 1 9©ngf4, mct&m<nf&&it*) <D&*mn-zm 

if L 14 Uf g (Co < T>f * £ #5 , r OlHfl&Bt? 
5fc©(c, ±7*-^^x»tt, &SItii7j££F B lT*frfl 

-rzzbhfl&thZo 

[0 17 6] 



0 x^6 3©££ 

out= 2 (x-64) 64^x^l9 1©tf 

058tC*5V>-C, ttMfeStt, RGBt'y-fe^Sr^tb 5 TjfeS^ 
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it, miO O^k^-fe^s^tvmSSSrtdfe^m^tlS 

J^a<D80(D{fi (P0-P7) £?Siip-< jioitsr t# 

[0 17 7] E15 7(c:*5^^-ci^.BJLfca , 9s 
*SB2 4 211, #feAAft^rtC4lf7 MZKSB 
fti 8 tfy hEfflrtteggMtfc^fj^S 1 2 try MHA 
5r©Umi- 0 iv f -^/^$P2 4 2f4#, ft, Kffc** 
4 try hERBlBft U 06O4 J <O5 2OO 
±5C*-012lfyh7K^ (IR, I G, IB) 
B 8 0 ft, 1-12 If y h7K^5 2 0 



?7D-i"Cfc5„ 12fyF7Kl/^5 2 0B > Sf 
4J&W188 1 (01 4 l) Oi5 4f f -i'*ts''>> 
mm&2 4 o<z>r Ku*£jMMc»e>*b. ;x*y^y 

(BO, Bl B7) (C*t1-£ 8|@(D9 tfy h 

7^f h 7 K L'^ 5 2 1 5:4^t 5, 7*— 

yv'a. (02) (i, 8ffi05&:&tf>^ y /<Vy 5 2 2 fc 
#S>J£*t, **v€ ? *tti8fla<D7'i' bT b'lsxic 

1 2 t'y hT 2 0A>b8 7^//W hT 

[0 17 8] SOGCS*- K{C*5»t5T KU-^-a-J* 

[0179] 

[S12A] 



* 12: SOGCSt-HCTKl^XWfiE 





1 Bit (8:6] 


Bit [5:3] 


Bit (2:01 


Bank 7 


W3:ll 


Gf3:11 


BJ3:l? 


Bank 6 


Rf3:ll 


Gf3:ll 


Bf3:n+Bf0] 


Bank 5 


Rf3:ll 


Gf3:l]+Gf0] 


B[3:l] 


Bank 4 


W3:ll 


G[3:L]4GrO] 


Br3:l]+Br01 


Bank 3 


R[3:I1+R[01 


G[3:l] 


B[3:ll 


Bank 2 


R[3:I]+R]0] 


G{3:ll 


BJ3.11+BKJ1 I 


Bank 1 


Rf3:lHR[01 


Gf3:il4<jr01 


Bf3:ll 1 


BankO 


R(3;l|+Rf01 


013:11^01 


B[3:ll+B(01 



— — — — - » — - — — w — • — J l ^ ■■■■»• L. v • 

3] , B I T [2 : 0] », -^H^g tTy M^yT 
K^C6H8iryK 3/5>e>5t'yK 0^f)2tf 
y R[3:l], G[3:l], B 

[3 : 1] til 2 t'y b7 2 0©4 t*y HERB 

IR, IG, I B(0JBli»&»3 bfy h£T£*-f 0 ^ 

1 2<0^*y/^V^ 51CPL-C, 1 2t*y h^F,9lfy 
h^©^y fcTVy'SrffJaiCRWi"*. 1 2 t'y hT 
*5 2 0tO4 tfy h#EW I r01~3 t'y If 
y KT Kl^B 5<D6~8 t'y hlCVy tfVy*$tl, 4 
t'y hJftKIR] I gWl~3 t'y h&$aW£1?lX9 f y h 
7KU*B5<D3~5t';y H^y tVy*$tk 4 f y 
h#K^I b<Dl~3 t'y M s 9 t'y t-7 Fl^B 5© 
0~2 t'y hlC-ry t'Vy*^tt5 0 

[0 18 1] 8 007-f V/^-T hT K^5 2 1 11, 
5 1 2 X8 t'y hi^JsKS^i-Sj**!)^^^ 5 2 2 
^C7K^tttffl^5>ji, Mt58fy hfflM 
£$5 2 3^#/*y/^y 5 2 2A»b9 yf 
^<DTKt->^^aiCitLtf, ffi^P0~P7|:« 
"t3tti7j£,MCV (PO) ~CV (P7) 
^tWWJTK^iftiiiWS. «ttf, l 

2 t'y h7 b'UX 0 0 0 0 0000 000011,1" 

^t^'y^tooo ooo o o o t ^ 5 H— <*v< 

y^T KextfftbJlSJP. 1 2 t'y h7 FM0 0 0 

0 o o o o o o o lorns-iax '<y* 7, 5, 3, 

1 Xit^V^ 7KW0O0 000000i*9, /< 
y?6, 4, 2, Ofll^^yT Kl^O 0 0 0 0 0 



5„ r«0<fc5tcLT, A#t°?-fe:/MI£l&»)iatp8ocD 

^-aj^&fltcv (po) ~cv (P7) tmmz.&* 

[0 18 2] El 6 ltt, fe^m*- K^fc^TfflVN 
btl5f-^*t5'i'a 2 3 0<Dt* y ^VycDfll^Sr 
^UTV^. ^*y/'y^«1 2 8 7^yxyMJi> 
e>/£9x #7^V^yhyf43 2t'y hft'4X8f5' 
h^^y 5 3 3~5 3 6*»b«J*$*tS. ^t!)7h> 
^5 2 1<D±7 f y bit, 7 KW'^tpW^-f-S 

9*-^5!fcftjeu t^V'Ottiitib Lt7^f5 4 
2-f5fcfe{cfflVNf,tv5>o T2 t'y M4/VhT 
t?, ■7/P^-7 e U' ; ?-y-5 4 3 — (D i: "9 , t*(7>4X8 
t'y hcx^ h y £tti7J<b LT^S?5 4 4i-**»«fcjfrf 

g|52 4 2tC^t>tbS,, SP*>, ¥—9**ry !s s=.WVm\t 
t^7VK^^f24 8 (02) Hl2t'/h 
hT K^*S»t»9, ±7*-#^»2 4 2IC 

*5»t5ttra^awfc*(D8 t'y ha^fettsr*^?^ k 

Jj— 2 4 7, 2 4 8tCtii^-r5 0 

[0 18 3] 06 0(c:jo^^•C, £7*-yvU*G2 4 2 

(02) immi&m* 3 xt- ?7-?mfii-z>, ±t—? 

(«»Jx.«5 5 0) tt^-f-S^^ey^Vy Wx.t£5 2 
2) A>e>HJ^*^<5fe|li:*KMffiB5#5 5 1 £A7J 
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Ul^i'^BA (HjL«5 5 3, 5 5 
4) 1^2^7-^5 5 61:1^ frac_gA^ 
5 5 7^ffl^T^2^xy7 P (DfiflS;^oTt±J^5 5 8 
ftfcfc, *2^^H1*B 5 8, 5 5 9 
ifrt c_bA^5 6 2 ir&ffl^T, 1»SWcX-3Vvr 
Si^til^fe 5 6 3 Srff^-TSo 
[0 18 4] 0 6 OK^Utjffilll, ^:*-eg:^:<7)^;W 

fc, 0 6 OW#fttt¥-tt5^feg^5 6 3#&K* 

ir, ^rtt^roW^fe^JffiS:/^^ £: ft-5 <fc 5 ft 4^*l=p 
b. fMfcfflrt-ISteaW*- K 

a^Dtj»t2 2 4liMOGCS*- K"C«)|!l^fetT5 
*, MOGCS^— KteVKo*W/!j.£|&^-CSOCG 
S*— KiJSffBHIIKftfW*. MOGCS^e-Kt? 
fi, 0 2W±T r -yV<.Xg|52 4 2, ^^Mryfafl 

#?gB2 4 0, ^-^^^s/^^teULT, mxz*i& 

y -^2 30 cDIM XiS 4flF*5.Rt 45*, SBtttM 
*«r«#Srt"<5fci?>JCMOGCS»fP*-K"Ctt, 7 s - ^ 

y 5/aMm« 2 4 0 ttffiAfeSW©-*"'<"C©a*fe 
«toi/4©*t»«M-*. ttit)&£$l<Dm CDW^fe 

ttiSft*. ftfc\ ^y^a.^*^ 
i^tttfjfiHttiffBro t°y -fc/Hafctvc h m Cleft 5 Sfe^ 

[0 18 5] 06 2(4, a^o-feyf-iJflHR^Y^/W* 

ffi (05 6) *ltt»OJ:5K****U I r, Ig, 1 
b 5 7 0 tl/^ofc 3 0(7)4 try MXIHjSMfkttSo 

Sttfci 2 try mr.5 7 oii«ra©si 2«otfi 

(W*ff5 7 2) H:06 3(r*3V^T£JlT-?KKt5«fc5 
cffvytv^^, *t&-f%**y/*>t 5 7 3^ 

yy7y7°LT4 0£)feffl;*)x + *^5 7 4**6*1 
5 0 5 7 3tt, £#-?5 1 2 X 3 2 try h 

*©5*><£>1 2 8X3 2tfy h 
by £*&*lW?>. /<>y 5 7 3f4r-**-lr 
y->3.2 3 0©— SB&ftU 0 6 3 Vftm-f i> £■ 0 t^f 



[0 18 6] 06 3(4, 9 fc*y h'<>9*J}5 7 8*5 

7 9lC|?^y tf>-^§tbS^-f-Sr*L-C*3»), tT-yh5 

8 0~5 8 2<7)|lB#^A*l#X.5^ t{C < t9^ ; ey^^ 

^*U7K^5 79lt **vm* 32fyhffll2 

8^vhy^t>j*s^-f.5^*y/<v^ Wx.(i5 8 

5) — c73TK^i:bTfflV^bn5. 7\fyh7^>T 
VUXfrRi^T**}) 5 8 5lcTyir*-f 5- tX\ * 

fc-^'J/^y- (Mti5 8 5) fit, W?W2t^ 

hoi 2 8^^hy*6^5Mii^^7< ; ey^*ri-?>o 
7 try b^vr Ku-^ri, reo^yv^-y 5 8 7 t><p 

^5 7 9<7>ft*:2 t:*s> HSr^^^^y 5 8 7 f ©»)S 

^£ftTV>3fl^5***5££*l5. vLC09 tfy YT 
Kv^t©t*2iryHt jfrifc^-^KMoirfctr 
yMc#J6"i-* (S12#I) „ t£oT, MOGCSt 

<fc < 4 t*PSK##J $ n» 9 b" y b T K ^ 2 fc* y 
h!j«RGBA*feKMf o«K«r»€-*-5ifctft5. 
BP*>, 2o<DfcTy H*y*fcJ;oT»j£$;ftfc4o©*IS 

^ ^<D£y- y*«K»«?i-*fettl*||[tta^fesEIB-e»K 

-cftBi-srtfcft*), ^y/^-vco;^ yr*£ 

[0 18 7] 200tTy h 9 y*#-9:LttV^«£-Ktt, 

x-y- * -Y- y fattflPWfi*-* y 5 * £IB»U ^ 
Sft ^ * y gg^ttl U*^r^ y *s*.fry 9 T y T'ftia 1 1 

hicT-?*^ v~>~m®%t£^xm®£iriz> 0 ft 
h y ti^-rsy-Y ^o-f^xomwnt* v *»feK 

5„ i6 3C»j(!li5 7 5|il6 2(D#^ty^ (^J 
x:(45 7 3) rtirUff^HSfc*, *ty^art*K 
iottt^tU^Ji^i* (^xli5 8 6) *W^j 
$ti5*-t?(-B*Pfl* s ^Stft.5^it)fc5o f-^5 8 
6(08003 2 Ify h±V HI, iO«±7*-^/'«^SS 

(242) ±m<Dtinsitm me 2) <d$ 

775-7 5 9 0-5 9 2*1-^<T©fe7^^/H^Bf(C^ 
5 4 O<0-fet## 5 9 5 

[0188] nm\z±M -iBLfatmmcisvz** 

yi/aO^^fJ&t'O. 0 1^^60. 0 3c7)tV-fe 
;ur'i0^t5'->'a7'( , y7ij'ftfe5©f > 06 2 
1 0 6 3 fcdo^T^ Lfc ^AtRflJ^^Sb-Cfc 5 r 
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[0 18 9] 3/nt^ti5|r5 20©^^- 



K (HI 0) ■C©ft^#*flstt»T©«3gSrWT3« 

[0 19 0] 
[SI 2 B] 





mm 






Operand A 


soarce pixels 


pixels 


packed stream 


Operand B 


multi output channel color 
conversion tables 


other 


rauki channel esc 
tables 


Operand C 


Interval and Fraction Tables 




I&F table format 


Result 


pixels 


pixels 


packed stream 




bytes 


unpacked bytes 


unpacked bytes, 
packed stream 



[0 19 1] IH 6 4(4, 6ffiBB***^-|C*JJt5*^7 



[0 19 2] 
[$13] 



$ 13: €&SWI£»Jfi*fc:*tt*T'f^— tf^a-Fx^o-x-f^ 





UK 


lrans[3:0] 




M 


0 = V • ti 7— 7— 7)1 7<-'7yh 
1= -^J^^v^U ■ i&—?—'fJlfJ*—'*<9 h 



[0 19 3] i6 5lt MOGCS^e- K^*5V>T, R 

fc^LTV^o ^TyT'S 1 Jd*5V>Tx 2 4 fcTy h RG 
Bt°^-f/U^ h!J— A^tT^-fe/Vah— iS-T^-f 2 4 6 
(H2) ^A^J^ixSo xj-yfs 2-Cf4 x E5 6<kig 
5 7T?KWUfcJ:5K:, f^-fc/K*— if 2 4 6** 
/Wy?7y:/x-7/W£ffl^T£A;ftB^©4fc* y MS 

2tf'A7Jf*-ir/KD# > St, *fe®m©4(fs; 
ISp-LT, 1 2 t> hT K^17-K*rftJ*U -©1 

2 t*y hT Ki'*?-^?*— **+y*>A«1lP»2 4 
0 (H12) \zmZ>o ^"ryfS4V\X $12tEl6 2 
fcte^TRW Lfc J; 0 T-^^fty VaWflPSS 2 4 
0))i;(Ol 2 f y NT K£80<D9 tfy H7" 

ttt/JltCV (P0) -CV (P7) (D^^D^y? 57 

3 (0 6 2) •foffiSSr*"*". **y^S r- 
^*tyVa((»»2 4 0 (02) ^80(D9^7h7 

Ku*£\ 06 3Tift9!LfcJ;5 {cff^s/tV^-fS,, 
r©J:$fcfC #fc»0>4 ify h KM©** try h 



5. 

[0 19 4] ^^i/T^S 9'— 
$52 4 03*9 M y YT KU*<£>ft*:2 fc*s> ht, 
5 8 7 (0 6 3) (p<0 2 \f y Y # ? t*\m.-f&» 2 k* 
y h^^9lf? hT Kw^coft^2 fc*s> h£-gtL& 

ttfttt, tttfjfittcv (po) -cv (P7) tt*+y 

->3.^!) 2 3 0lr#£L&V\ ?tot^7 3'/S 7fc 

SM^-y^bf*— 2 3 0 lCSg^&**l5o 

2 fc*y b^/i*9 Ify hT K^O**2 Ify hi— & 

fejlcv (P0) -cv (P7) $r»9ffi-f-. r©J:5 

(P0) -CV (P7) ^±x-^/N°^g|52 4 2{ii-o 
Tf-^*tJ''/a2 3 0/i^Sl9ai*tV?)o ^yT" 
S7-CJ4, X?y?S 2 tHzE.mftVtM&R^ 
TttlTJfeftCV (P0) -CV (P7) /J*±y*-*/^ 

SP2 4 2fc:fc^Tin?s£*^ wiasfifcta^feit^tti^ 

[0 19 5] ^rt?, RGBfiaiB*>f«?1-5aAfi« 
Sr4«R£H±, «Jt«3 2^ny^|c3HC»tW6rfc 
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fc, MOGCS*- K-Cfflv^nSx— 

[0196] 3. 17. 3 JPEG»fft/If 

«lp<Olo«SjPEG«Jp-C*>5i6S, J PEGTOdM 
i"5S£*ffl *iftWttV a n Nostrand Rein 
h o 1 1 9 9 3#|CWHg§tbfcP e n n e b a 

keriMitchel 1 I' J; SSfc&fc* fj PEG : 
Still Image Data Compress 
ion Standard] £#JS£tufcl-\ ) aT'nt 
y1r2 2 4 14 J PEG^JpWf-^-fes/ hSrfflV^MIMr 
IMfr*-*. J PEG«S3p<DffJ£f4, RfltfctttfLfc** 

ttbT*irt-5fc«>Ktt«l©**Srffl^rt>av\ JP 
EG«J|ltt*n*K:H:A<*pP,nfc«*-C«)9» AS I 
csteJB^S£J:#-e*aJ:5ftj PEGSrHSSUfca 
* J P E G 3TW4 2ffc^tfttWMf 

[0 19 7] 3^^2 248, 1, 3, 4&gijt 
i»b*6B*S: J PEG»-^t/«*-r6«MBS:«iT 

$/a £ JVC ^ ft V * CO if 6) ^-e t ® <? m-f r t # 

J4 % a^ntyf2 2 4 ft 3 fef-Y*^*/h 3/<-f h 

[0 19 8] J PEGtg$fiWfc«r*/h#^fcfMfc (M 
CU) £l*|±h,3/h3*2&*^K:$HM"t-$. Si 
T\ #*/Jv»^lsSBHittt»fi:k:«Ha$ix«. JPEG4? 
■*{b# (122) (4, ^fr^y^y ^SiufcPMW)* 

/V, St8 f^-feyVcOMCU-Ct.^V\ 

[o 1 9 9] Ei6 6f4, zv&mvt&yvyw-? 1 )* 

li, #tf^t;K46 0 li5YUVfe^t?(DY, U, V 
Sf&a»e>#5 f^tyl'fS?MUV/-'s'77 25 0 (12 



2) K&*ft$*VO^o ^fflf-ii'IJ4f4O(0f-? 
^□y^601~60 4d>&#£>MCUlMirtt:&ifc$*l 

0 1, 6 0 2 ttESff-^^$HfcYSaST?*>9 , 
5^ 6 0 3, 6 0 4f40 3©W£^>T1^1ry:/^$ 
ftfcU, VS^t?fc5e ::f, =7"ntyf2 2 4li 

* y v^Lft^K^f-^y v^-cfc9, 

T-fcSo 9 U, Vg|%7^^ 

*y ^-tss 5. 
[0200] t> 5— o(o j PEGt7'fy7*y y?\-t. 
me 7{oi%Lfc4^^^/i'-y-^f-v7 p y y?xhz> a 

:©t/fy7'!;ym, 16X8Wt;V6 10© 
tf^ir/w-r— ^^ns/^^ii^r(OY, U, VgSi-SRx 
(O) ?r#tr4S^6 1 lfc^LT^ 

5„ ^(ot^-t/UT 5 — ^6 1 o<bH6 6tmmz-*7-t- 

Zm^Xf—fJuy? 6 1 2, 6 13A«$H5. 
[0201] 168^ I2 2W J PEG^-§-'ft:§§2 4 1 

^2 4 111, j PEGn^{kt'&%-b<DMif&ftoo n 

fkittimtt. ^7 6 2 0 4r^UTVf^"fe/u*-*^f 
24 6 (I2 2) ^e>^'ns/^v*-^Sr§:ff1 _ S„ ^ay 
^f-^llMUV^y 7r 2 5 0 
Zbtetm&ttZtiZ. J PEG^WltvKoi* 

1. DCTWC*Jtt5ll»3-9--f ^»©HfT6 2 1 

2. DCTtti73<04^6 2 2 

3. Awme 2 z-cmnzftz^y-fyx^^^x 

5DCT#Sc<0SE« 

4. ffi*ft*fl:S6 2 3T'Hff$ftSDC DCTtHR 

5. A7-7^*ft»6 2 4^*ff*ft«4fflfc#*flS* 
©a*0^r«*«F-Wb. W^tt-^A^-T'U^f-e 2 5 £ 
Rb u s 6 2 6 S:^*LT^^— 2 9 (12 

2) fca&e>na. 

[0 2 0 2] J P E G«#*a®4, J P E G«F^fk»fp 
^fCLfctWT'fcSo BP*>, JPEGm^Stt, B 
u s 6 2 O^ibJEEifi^ft/c J PEG7'ny?S:A^t5 
*Hi«r&tr. Eif-^liBus 6 3 0HLTa7v 
V##fl^6 2 4K3§e>;}V f-^i5DCi^tAC7 

^dr + >-(cM$^5o *<0#, *^Hk»6 2 2{C*3V^T 

d cflWcKWJCi-ai^HkiifcjRWi-s s t "CD c^^c 

<0^4^-fb^t5jx5c MflkK, DCT^6 2 1(C*3^ 

tls Bus6 3 1^lT^f7'U7t6 2 5 1 Bu 
s 6 2 6 Sr^- fCig**-^-r ifi^fc^S, J P E 
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GfiHrfcS 2 4 ltt, J P e G*MHts©»f£*rH*&$ 

3*r<fc 5*gtfgCb u s4>?7zc~-X 6 3 

▼ ^NF-W5»6 2 4l*?-^'t'*ft!!£'f«#S, 
&EB3MC^— 2 3 0*»e>o— K*tb5. T 
— TVHf— * fit, Obu s^^7i- *IH56 3 4£?> 

6 3 4 tt3)v<9 V K*-^-f If B 2 4 7 KS«E$*L, 

[0203] DCTSB621 14 h"^ -fezl^-^ \Z.% LT 

mn^^^Witmmn^-ommt^n^, dc 

^Pb^Tfct), Tstill Image Data 
Compression Standardj (IrIJ:) 
<D'PiChBi£tlX\ / ^Z)h<D<D, D C T 6 2 1 mXT^ 
rftlDCTKiJ -CffaSi-Sjea^feSrfflV^T^*. 
ft**, DCT^&ftffrC&l^-Cfi, The Trans 
actions of the IEICE, vol. 
E71, no. 11, November 198801 
0 9 5^-^K»«§^fc|CAr a i biZXZfaX Ta 
Fast DCT — S Q Scheme for I 
magesj lCg<J< D CT^&^Mfc&fflV^ i £ t>T? 

[0 2 0 4] i^fc»6 2 2ttDCTflMR©**flSfc» 

7 1 — TOW^MaHifcO b u s-f y^7i- *g&6 3 4 

[0 2 0 5] 06 9tt, itt*-7-{IS6 2 2*J:9»«HC» 

U-CDCT*^a-/V6 2 1 lef- tZmLtz?) , DC 
T^a.— A'G 2 1 fr<b*f— P&gitfWl.itc'O-fZDC 
T^y^7^-7.6 4 0Sr«x.5 o ft^-ft^SJC^T 
it, fi-f-fbSe 2 2tt^D5/^lJ--r^yuri:ic2oWD 

7r641, 6 4 2COlO(C##3i*ttSo /^7r6 
4 1, 6 4 2HA^7 ? -*£^S'77-r6fc&tf>2o(£> 
h %m*-1t^ y 7 7 "C* So i^fc&Ski&^T, 
2 1 ^fe^fiMBC^— 9tt/<y7 
76 4 1, 6 4 2©l-o|Cft#rt$H5 0 ^y7r^7/V 

ob u s -ry^7^-^$i56 3 4 Sr^urs 

«LfcJtHkfS-CboT*J|[S6 4 3T?*J|CSn5. ^ 
OW^jtt^Sc^ft'i >?7x-^ 6 4 5 *^UTtS* 
iflHHfc»6 2 3 (06 8) t£igSi££ix3o 31 ft 



fi#&*ft-ci^5 0 j PEGffl#«yin:*v^r, ftf-fk; 
ftfcD c t«» «r*iM-s - 1 -em&Hum&ft o o 

t L"C, 8 X 8 ©7a y ^<0«©fttSfflV^„ 
[0 2 0 6] *Hb«at|gI*K:, 2oo/^>7r6 4 
1, 6 4 2«tf«6 2 3 (06 8) A>b©A^ 

flrHbllfcSI*£ft, St??*?*** >MK.*y7T 
tc#t&*ft.5„ ^^77^7/Kc45t, iS»*^bS 

ftfcflafc8*sflir<ojfi#-cv< ^7?^ 2 ohhskb^w 

£ft, DCT-^ V^7x-^640 SriMtDC T"T7* 
^3.— /U6 2 1 (0 6 8) (C&bftSo &o-C, 
WHUBM y^7i-^-^-/W6 4 5 », «*«F* 

•fiitf. DCT^f^7i- A-6 4 0 tC 
C'f^7i-^*i'a-;V6 4 5i fc/^7ri>6© 

KwVi»7/I^7 , U^t6 4 7&J8VT, 

7*— *as^ibk<&/<y7r £»f£LTi>5<D/&»Sr, * 

4 8©S!lffil©t>^T% 
4 3lt 1 6 X 8 © 2 oMfto)RIM»$r 

[0 20 7] 068 ^*3</>"C, ffWWF-WyB 6 2 3 ttgl 

(a) J PEG*-KK*itt*DCfiMBc«>^»l«F#fl2 
/«^- 

(b) J PEG*-KW5AC#S©7VI^^ 
**5, «»W#flrfB 6 2 3 He, JPEG^t- t 

fl*«F*HfcJB6 2 3Ht, tT^y y ^Ki^$nTV^ 
UWft?o *fc, jPEGWflilcassttTV^J:^ 

^jpeg Ac««©7^wy^^w^fb/af-Kai 

[0 2 0 8] ^7-r>-4f#^Si6 2 4Ht, JPEGf- 
-K-CJ4, «»»#{Ug6 2 3^f)7>uy7^fff{k 
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A7v^ h y— AflSp b u s^y?7i-^ 6 2 0A> 

i*Hia8 2 4B, f-?*tS'i'at^^, O 
b u s-f^7i^6 3 4&-fr\^XT?-tx£tl%^ 



[0 2 0 9] '^^Wf-fclCte^Tx*— y^+yi'a. 

[0 2 10] 
[*1 4] 





8MB 


0 


EHUFCOJDCJ 8>&M& EHUFCO t — ~}R*G) 256 ffi© 16 If «yf*X>l- 

@£X#?*s„ ro/^j/O^of -<"t© 126 =y&ifim^t>ti*. 


1 


i <>9 1 1* EHUFCO.DC 2 =T—^}Un 266 fl© 16 t?*hX>MJ£lMfrr.&. 

tv**©**^** tf:>K>-K+©20© 16 tlvbms&Mlt *. -© 


2 


/i>92& EHUFCO AC 2 T-TfiKJ) 266 tt© 16 tfyl*X>M.J£*irtfrf 5. 
Ty**G>*'l^vh& t 32 t?5*9-F«|> ©2o© 16 e*NHBta§tt a S. z& 


3 


/<>*3fcfc EHUFCO AC 2 T—JMb 256 00) 16 lf7hX>MJS16tfr*"*. 

T^x©«*evhj&« 32 e^^-K+©2-3<o i6 e-yh«a*ai?-ra. c© 


4 


/0£4I* EHUFSI 0C.1 %A\M± EHUFSI t— :f Jl/© 256 B©4fc?yhx:>FJ£. 
EHUFSLDC.2 t~VjW0 256 «©4e?hX>h"Jfc£feJC«Hfrr*. Z©*^/* 


5 


/^6I* EHUFSI AC 1 x-^JU© 256 «©4tf^X>hU^» EHUFSI AC.2 x 
-?/u© 266 fl©4evKi:>h'JittU*ttr<S. ZJto***)l*J*tXt«Xm 
128 5Y>A«ffl«.^*1-««i 


6 




7 


/<>$7{±ft>Hb-f--:7>a> 128 fl© 24 tf?hX>KI£«tfW*. JfJ/O* 
©-r^T© 1 28 7«f i/©**3/« h££©$. 



[021 i ] E7 otc^r, /^y-?>$m-fc&6 2 4 

I4> #-*Hfc§§6 6 0 6 1 t»2oc7?3S*eoy 

uy9b*b±\cMi&$flZ>. »*07"p y? 6 6 0, 6 
6 Hi-v/Vx7 P U'^i?-^^zt-/W6 6 2&7TL-CI3CO 
bus^y^7!-^^ft5. #^ny^fi, Jtii 

flMBtefSCT, -»jftTtt^t>6*»lo©^ny^©* 

a. *J-»flS 

J PEG*— Klc*J»t*W-f-^K*V»-CH:, a77>7 

Cf-^^a-^babtfCtfcDC^ACflyfc©* 
wt*y httt^U 'tyt** h ft 5. /< 

y*Jffil©|»k X' FF/W btf^bftfct-f 5<t, 
X* 0 htfttASftS. RSTnT-*^ 



v^t© ri J t*y bT©/<-f hiSii&at, 
ftfc/W h#X' FF{C&o7ct#<E>X' 0 0/^ Hf 
Afea*5fft>tl5„ R S TmT-*^SJi>if 0 
CC^7**^a-/HC.to-C»*$tt5. *fc. HCf 
7**^3.— /MS, Pbus-CC7l'-7'-fy?7i- 

*±© r«:&©j flr*i?©flK%K:j:*>, Hfft©*«KE 
O I * SrJf Ai-5c E O I v-# ©Jf AJ&a»-*5V n 

a, jfA^a^fiirjfca. W7j^hy-A« 
^gp^*y {c«#ii*H5 0 

[0 2 1 2] * J PEG*— K©«^-CI± % CCt^ 
(Pbus-CC^l^7*>f^7x-7>) a» 

5. ^HJ (J PEG*-KiPI^') #^5-y a 

h y -Aofg©/v h it 1 ^©tti&fflaasfffrih,*. 

b. 
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[0 2 15] J PEGffiFf-fb* M>— AfcStfLT* 

y j//n'-6 7 i ic^tx' o o#a^ k x' ff 

flfeCDf y h 1 1 '(>l^7vyv'yt^Sv'7^-6 7 2 

co^yX}^ A7-?yf-^x h !) - Aoft|0©8 f 
y \-~QT Y\si/>y&)Xtz*\ yi^lCl&^ftfcHU 
FVALt-^;W2 5 6<7?^V r- y tfryfTy-f-f 

7 6tOiSj££jV «^tW»^*tWiPf'i/ hgci 
Ify r-SrfflTJ?*— Ty*-6 7 6(C^aiU, ®^8C6 

7 3 \zmz>^y?yx h y — Acofr u^&sm4£&?ij 
i-?> 0 rd-e, f^tptry h^:i±a^ffiwM^-cfc5 0 ft 

$}<D/l<y ^ T yZ?W&^K*£ l ht£fr<>tz.W&s BP*>^ 
7-?>-i/y#JUtf8 tfy h«Ji-C*)ofc^lCfi, t- 

XT Hi^#fr3t$tu -StT5*t?> fcSvM* r^s 

[0 2 16] fc-XlCg<5<^T^yXi>,|i£(T£> 



A7J* h y-Atf>fttOtf>8 tTy bSr I NDEXKteSft 
HUFVAL (INDEX) Sr7ayf- 

If HUFVAL (INDEX) = = 0 0 x x 0 0 0 1 1 1 . . (ILL) 
exit 

elseif HUFVAL (INDEX) ==lnnn eeee eee 

e (HIT) 

nnnt'yb&eeee eeee (cH t LT$5j£ 
^y#;wftN = de c ima 1 (n n n) £rtn£ 
X* o oo^i/y^fis t LT*X 
atj^ h y -AOSS 

break 

e 1 s e/*HUFVAL (INDEX) ==01 i i i iiii iii 
i (MISS) 

HEAP I NDEX = = i i iiii iii ii^-feyh (fc — X-S-* 
£OKHK£) 

N=9(C-fey h 

If A73* hy-A<Dfg9tfy btfOX~hZ 
HEAP I NDEX^rl I##P 
f i 

VALUE=HEAP (HEAP INDEX) <D7x. y ^ (^ 9 tfy h <D 



m<Dh<DhZ$$x.Z><, nmr^=rv xahj peg*- 

KW^-Ctbf^U {SiiTVU^y X-MiJ PEG^e-K-C 
*>#J PEG^-KT't»)^-r5. 
[0 2 13] itijtJ PEG^7-7y|f7^^!) XA 
ft, /N7-7y^y7j?/WSrDC^^lfe5^(tACyyu 
y^^I«t'*.e)Wc-7y^y^t5 0 dftfi^l' J P 

e Gicj§-f ?> x o icm^ztixte *) , n^tmcn^x 

ff!|(D/N7-7yr-^ (K3, K4, K5, K6) 

^B^^TOJL^lj-tv^^^v^i 5 L 
tchV-CfoZ, /<>-K (RSTm-r-*t?E§3f5^yty 
ny^) tr^^SHC^X^^-zWOx— y" I'— r- 
li, 1 ^ n f/f ^/Ptllff 1 o©D C/AC#f -C 
#>3o HCt7* ; e^a-/l'iCCt7' ; f^-Mt 
ft, 7*-?* h y-A^bX* 0 OlfA/^ hSrfM-t 
ZtzMC. 1 y Py^lMy/l^glCftSCifcfcS 

[0 2 14] K-CftflsU ^ 

n y y iH' y /ur <k \£ 1 /N^-^y i^v^/i'SrttW-t-So 

^6 6 111 t-X(Cg-3<ffijI^-T/l^y XA£ 
«X.T:}o9, 0 7 lf;i^ttili£6 7 OfcftoTV^. 

#*Tvw-X 
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Loop 

If VALUE==0001 0000 
exit 

elseif VALUE=== 10 0 0 
eeee eeee Srffit Lt^ii 

break 
el se/*VALUE = = 01 i i i 



1111 (NL) 



eeee eeee 



1111 



l l l 



i ( 



MISS) 



i) 



N = N+ 1 Miry I- (HEAP I NDEX= i i i i i i i i 

If A** hy-ACDSgNfc'y h&O 
HE AP I NDEX£lii*P 
f i 

VALUE=HEAP (HEAP INDEX) (D7x.y<?- 
pool 



pool 

* h!) 7 111, A# J PEG 6 7 lff-f^t* M 

-A2»lbX' 0 0SA/^h« X' FF!S&^/-WK 

b l J — Att/N^Ti^vyjff/K^^-^fo^s, 

[02 17] ~>7*6 7 2;/as/^ti:l 6 tfy btHtl^ 
R©A7-?y»>y^m»6 7 3K 
(MS Bri>?>LS BCDH#©.tr y M) — AT*) 

if^fctt^ tr v b zmvr-rz a>&fcjt-r swu^tai 6 

7 3tett3*VO>$. v"?* 6 7 2te«^g&6 7 3a»P> 



EG^-KtcfcttS) 9li'V7l?/H£»<fWnb'y hftfc 

[0 2 18] &^g|56 7 3lik-/i:l^<7;l'-/!)X 
A<D3T5rH^b-C*5«?. Obus 6 7 4iS-Cf-^ 

?$'*K:SSR&*vCV'>S. a^SB6 7 3li, x--* 

5) SHI* 5. rrT\ 7iyf7 Ku*tt£lT©J:3 

[0 2 19] 7iyf7 KW* 
[0 2 2 0] 
[«1 5] 





tttt 


[3Z:ZS1 


ffifi^tCdcquantizadoti)^— 


[24:191 


Not used. 


[18:9] 




[8:0] 





4 tfy b&btttoV-y b tRSTm-7-*1tit©3 2 
try M7-K^©*S# (JPEG*— K) $rff5» Win 
fcV Mi, «HHB6 7 3*3lS/>'#A'k:*H - *f*!«Hfy 

6 7 3tt, K$rM81-3fc«>fc 1 17- 

5, «-f«mic*5i^-CHL, {Kit* *>£>-?*>) <s 



t, mi37*—- ryf>6 7 6Hgfia<0 1«^7-K (F 

i FOfc*£#fcL-o*a) & r*fcj turmwu 

1-5. 

[0 2 2 2] 17 0{Cjo(t5^7-7>^ft:S6 600 
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j]7*~ 6 8 3, ^^s/->3.^P>T^-fe^$tv5 
/Vy^TyT'f-^ItS. A;W66 8 5tiT-* 

SB6 8 1 iC^Tft^fcSa-tS. •x-^Vl'i: LTfcfc, I? 

^yjtf/Ufcft^flSt-S&fctt^YyS'a. 2 3 0^©7^ 
ft*5> J P E GjElitCjoViTii, A 
C«*fc D C«*^ ft 5 0 4 

2tc±oT4!! ! a$nT, HJ73* HJ-ASrtfy hl/^ 
■Cfll*-f5. ■>7?6 8 2U 1 iK«l#W^-f 

x-f y^mX'hpRSTmtRO I ^— A*i®£ 

3(C^iS$ix, X' 0 h-CCDjfA&a, X' FF 



[0 2 2 3] X' FF/V hC0lfA^ai4->7^-6 8 
2l££oT : ?T*5ft-5fc#\ 6 8 3« 

X' FF^ h§rMfC^A't5fcftlC > ->7^68 2iS» 

6 8 2rtJC-(ix.5r ir{Cj:oTtTt)^5o *<< h«JMc 
flsfc-f •^-^JfAtoatciioV^Tv'7^- 
6 8 2(aot^Wlt$tl5. tt-&«m«6 8 3tt^ 

-*Kift3iox" ff" /-w nsjn*icr±i*A«a«:fTt> 

So 

[0 2 2 4] A7vy^fffTOj PEGffiilCiot^ 
4fc£lM3 8otf>7 i — 2 ocoy 1 — 

[0 2 2 5] 
[*1 6] 
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[0 2 2 6] 

/N7vyf- X/vf4, ayBtytf-^^ty>'a2 
a. 2 3 OH:, #7-f^8!7- Ktf^fiScSl 2 8y^> 

■7— 7 s — X/Wi/J>^ (^2 5 6 
3fg) /JKO-C, Obus©3 2t*y h7KW7-f-;V 



[0 2 2 7] ±i£coj;3K, J PEGfftj&fc^-K-C 
li, &*^7^>-^-7Vu£|&jW^3fcfcfc:if-** 

[0 2 2 8] 
(S171 



EHUFSI 



-y-rx 



256 



Huffman 3- FtKX. S«A7?>fiF^(t 



EHUFCO 



EHUFSI DC 1 



256 



16 



jpeg jeht dc co \zmm 



EHUFCO DC 1 



16 



JPEGffiffiT DC «ft*«F<Mtr*Ofcfe« 
UH>T7?^iLTffiffl. 1W->X 



EHUFSI DC 2 



EHUFCO_DC_2 



EHUFS1_AC_1 



EHUFCO_AC_l 



16 



16 



256 



256 



jpeg mx dc ts& sra <nfci-s co \zm 



JPEG ffiJWf DC fii&£ft*Hfc-f SCOfclfcffl 

£ft£^7V>3-FiMX. 



JPEG ff.«T AC fctt £ft ^fc-T •ScOfC'Kffl 

7- 3 'J £ 7 > U £ ■i y=f v £ X i: U 
T«ffl. 



jpeg be Sir ac cfe«c«-J3f%<frscotffiffl 

TSiJH. 



tr^-tf-^^SftvfidcfrW JPEGJIfi&T AC 
«»£?J*MbT&coKfit«ffi 3 nsy\ 7T > 
3- KIM* X. ***©rt^3*Ufc?>l<' 



EHUFSI_AC_2 



EHUFCO _AC_2 
J. 1 7. 4 J 



256 



256 



Vzf-V > 7Jl^*n?fi)c^co JPEG E&T' AC 

«ft £8 ^ftT-s co tc fig/H s n s 7 v y 
— — rr 
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o~3 1024 e % i6 e<yhtf>fc-:ftRi£«*-r*, *'Mi'T?*Xi;*N=*V*/«>0 

fffflyyyt§» 

4 DC /\77>«#T-^/K» 12 e?Hia<D3%«*8t55/|«£ 512 fiftttfS. 

jt^'J/C/frD^Tg) 12BX>Am*feftS« 

5 AC /\T7>d[^T-^^«> 12 eyW«B©^*^at!vH£ 512 «€wr*. 
Al^rvf/ X*0>«<h2tfvHn*y, 32 bf^h9-K*<04O<0/<-<haB€«fil 

^S. X^rj/O/yor^T^) 12B>PfttfBl*ta*uS. 

6 DC fe&tffC AC /%7^>« ^7"— 7/WD»*4tf ft-fVT?** 
q>K»ge»HcJ:V> K»<P40<P-jJi«g|R^o 

7 M^-ftT^-^/KD 24 tf'vNSB* 128 fi*fc-f*. 'J/ O*®* -<T© 123 



[0 2 2 9] J P E Gft^tiSg 2 4 1 (02) tC&l^T 
* (PO_IDR) *>3VMi (RO_IDR) K&isOft 

[0 2 3 0] 3^nt?t2 2 4IC«fc9ffl$*t.fc1-'<T 
mbO«l6tt, A*tW*f-^© 

i#KW8lrC<fc5. CftfeoaUtt, tti#^-*£## 

ft try hSr-Y^-^/Hci-SifcT^tftS. u*» 
3B8fctt» 3/nt'yf2 2 4ttW7JX-*C9 rgijftj 

[0231] @73 srfflv^-c, nm, mm^m^mm-r 

3. A#B&6 9 Ott, $>5i«£ 6 9 1 6 9 2 

tfSLtf Ui#£-f3o La>L4*6» J PEG«F#{fc-> 



Bffc©M#8©^fcfc&*V'>»#$\ MCU6 
9 5 «r«*i-5 H'MMMMto* * t>A> i:JJW £ -ft Uft 
vW4t*. HW7lMlfcU'$***RO_cu 

© «fc 5 ftfcyifc «fc d , ftjj&WTW h 6 9 8 stftov*- 

[0 2 3 2] J PEGfffflll3lt59R MRtttttf 

0 7 4 fcjjfrf* J: 5 fc> m^-m&v 1 * MJ y 
-fl 0 1<D— S5&7 0 0SrttW3b5V^Ht»8l(i-5»&-C 
*>3. f5 2©^-^.ft, 0 7 5\Cm-TX0iC, £&<7>m 
&7 1 h 9 yT" 

7 11, 712, 713) ©ttffl&SVM^^S^St 

[0 2 3 3] J PEG#^CD^7^-— ?y 1^7 

[0 2 3 4] 
[*1 8] 
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km 


D 




M 




4 




S 




H 




C 




1 


U — [D /J VZ 7J V KJ k ** ~ 


F 





[0 2 3 5] 3. 17. 

a^ntyf 2 2 4 liH! 2 CO J P E G^-^-ftS&CO— SB£ 

fMMWlfro-CVSr fct>H*LV\, Sic, yvvv 

^mn^imhmmo^x-b 

iX&L&cR^btiZZt&XZZ. 
[0 2 3 6] 3. 17. 6 MMDCTmW: 
^©07 7£*Ut.fc 0t£ffl&=**Ml'm& (DC 
T) SSg-Ctt, *"f8X8^ny^©yiJ3ST|fi]k:»LTl 
&7cDCT£Hff U ftVNT-8 X 8 tf^-fe/^ny^O 
ff#ft.cM.ii&76DCTl-5r<h^£?>, 8X8 1^ 

ttt..A*HKl 0 9 6, JFfflllgl 10 4, Mffl®% 

1098, e&^ymssi 090, a^iag&i o 9 2 

[0 2 3 7] ATjIeI&I 0 9 6l48X87'Dy^i>? ) 8 
If S' h tr^-fc/HSrgffi ATjBSS 1 0 9 6 It, *|B] 
Y/i'fT'^tllOO^ 1 1 0 2*frLXWffi®&l 
1 0 4 *iTV^. gfl?li!g§ 110414, 8X8 

5o 1H#PI3I8 1 0 9 8 tt, i&<D£X<r>m$teM#L\ D 

^ * U 1 0 9 0 , ? 1 0 9 5 , WTJlHjfg 10 9 2 

nift^^y ttwcv^s'i^iM i oo 

KttK&tu ▼A'^V'^f-i i o oii^co-^/u^^u 

110 2f4Wi?^^ 1 0 9 4/&»f>C97*-*£'bgffi1" 
5. flUUallS 1 0 90l48X87'ny^ r-^ Sr?iM3S; 

t-atju jTj&are^-j'fcttia-fs. tB7j[E]BSio9 

[0 2 3 8] iIStf>D C T««T?t±» gffiHK 1104 

^^co^a^s^^s-rso 177 rojf^BK 1104 



pgl 1 4 4 , 1148, 1152, 1156 ^JP^^ 

JtoWB^fe*!,*. 54IMIfiaKl 1 0 4tl4, * 
-©*ijHaK^EIKl 1 0 4<D«*cr>&a©|Sg£||trt- 

*, ^#©«Bllc!fi-6l*|IllJ±#*!Hl«BR:Ri-5l*W 

[0239] 078 tt, 077 ©§6BlC*5tt 5il?it<7?g 
tlf?*— ^^Sr*Ufct>0-C*)»), DCT«r4«Uia(S 

^Sr*Lfct>roT'/<C<, »tB**Lfct>«)T?*)6. 4«L 
111^1 1 4 4, 1148, 1152, 1 1 56©^ 

ems, m-(Dm*f$.^m%®&t^xmmfrz. * 

^A-rfctc, 10:7cDCTCO4&g©PWl 1 4 4, 1 
1 4 8, 1 1 5 2, 1 1 5 6©*ft^*t#W»J*£*t 
5 0 *fc, £©l§M&fc::J3^-Ctt, 4fcMll44 > 
1148, 1152, 1 1 5 6<D%tl ; £ixtf#m<D&M 

[0 2 4 0] LfrLfttfb* Z<D®V&<DXM.ttM&tffc 
jSlCftotV&V^ bXbSo 4 *Q.gSPg 1 1 4 4 , 1 
1 4 8, 1 1 5 2, 1 1 5 6 it^tl^timm^^ 

^<Dm—f-^hm^fix^^o ^cofc*, 

<DmX\V?nyi> 1 1 4 4<D2 0 n s) „ A^iffl^ 
A^/U^-fl-l 1 4 6 t 1 1 5 4C0Sli (^-tL-etv2.n 
s) t, 7^^^70 7/1 1 5 0©II (3ns) £ 
J£1-t, ±#:©ji3S!jS2 7n s £ft5. tJ§oT, C<DD 
CT*^(4ftai2 7 n s 1?«rfWS. 
[0 24 1] ^^5W^*©DCT#jaij:<*Pe) 
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A/WDCTMi Jt^T t fcaoT &.Mt£'&M/y-'( X 
®&*?mmf l &*^1r« B7 9lt t^ir/i^-^A 
#081 1 2 6ICA;o£tU SfyhWt/Vf-^ 
?fl£&iW1-3 i 5 4 J P E G«F*fk» (B 2 ) fciiV^C 

1 2 6fclMM*!> 1118 
d>e>C9^;m, ?M"^^tl 1 2 4 iC&^T-^/V 
^7V*-»-^£;Jv Hi*?*-* #11*081 1 2 2ir 
iiibtvSc JH»Ih1J&1 1 2 2«3te*tt. 2 [US CD/-?*© 
KT«ttW>IS»l 1 2 0tCi£?,tv6 o ttf«l@P& 1 1 1 6 

[0242] iwRs^^aHftftiaojBiiaBo/^-e 

KiSSEft$*L53E»Piffe«»iS, A^HISSl 1 2 6 Kg 
f>H5o tA^I^IM 1 2 4iiUfP 

Blftl 1 1 6££oTR£3iUt A^ETCSl 1 2 6*»?j 
IMFIbI&I 1 2 2tcf l -91fijkbtiZ. IH8 0ti, gfff 
EIR1 1 2 2©*j*$r±0l$tofc^LfcBlT**>5. 7* 

IWfcaf-'f ^**Sr»fi-6 7*!7-Kiai5l 13 8<0 
^^W/U^^U^iM 1 2 4Kl:i3V v ta«$ft«. w 
TA'f-TV^iM 1 2 4 ttfffliPIilSS 1 1 1 6ldi 

floiPiBiKi 1 2 6 owfeK&i^x, mmi&i 1 4 

0^b<Omt}t^fV=f-^U^-^l 1 4 2tC*5Wr®^$ 

*t,-5 0 IBB Qm^t YMM&WBtkl 1 

2 2 (mm% \ 1 6 6fc£o-CWaHaa££iK.a) 

ctoT«is$Hfcm, ^m^t \>^tfw&**v 1 1 1 
hjwtmnmt)*), w.&**rV ill 8i£#ii**T,£ 

«»3-9--f >-K*©2|lia©^i6S8l!*&j*i*. 
[0 2 4 3] 7 *!7-K*) 4 V^ttiS!li» a f-^vatUW) 
2@H0V<*-ett:, ffl&3W<9 hATS»«fcp«*!J 1 1 
1 8*»5>K*W**i» iM 12 4£^LT 

JttfflalBl 1 2 2lCj£<btl,5 0 ^CD^T-H:, T/Vf-y 
U-^IM 1 2 4 ttA^BSS 1 1 3 6frt><D7 : — 
U g&>*y l l l setoff"** Y/is7—f*%ffi 
BSSl 1 2 2fc|EiS1-$i$fc. WftlEIKKioTlil:* 
£ft5o HflfBES 112 2 ^©v/l,f/l/^t 114 2 
tt» #HI&§1 l 4 0 *rfclMFBIKl 1 2 

2©lti)!jto£5. fflfSl 1 2 2i>>h<0&Mkifi&tbh 

fcBf^-c, mwbki 1 1 sfrbtoft^m^xmt) 

08 1 l 2 0tttt#SrJR9ii*, fiJlfc<0l*£"CtM7'f 
[0 244] Uflf BBS 1122^ •f»MlBf**tMrt1-* 



■cva. mmi&i ii6it ^-^a^e&i 1 3 

6»»6tA'^V'^1M 1 2 4 J HfflfB&l 12 2^ 

SftfSBBSl 1 2 2 0HJ**»e>©«*^^ Ww&ttWBBS 
1 1 2 0K:Bi«Jjitfl**4rjE*lcf83j%'t6ii:«s-(?* 
3. gffi08§i 1 2 2tc*jv^«fMia««r»fcftv^^ 

B#W«r€< r<t^-ct5t tt>K, *fff@Jgl 1 2 2S: 

So 

[ 0 2 4 5 ] M 8 1 ft, ^<Dajffi* s *t- 4 OCD&&© 
1158, 1160, 1162, 1164<7?fP, 2 On 
s + 10ns + 12ns+15ns = 57nsfc#»K 

@7 8<D0KJ:9 bm&bftZz. t«r*L-cv^. :o 

4 ^ n y ^ iJ-W ^/i/M>gT?S)5 <b> 
^©DCT^^AtC^^T^/h^tT^KfiS 7/4 
ns (14. 25ns) i40, 07 8f|j:DCT^D 
y?W^^2 7 n s £"*£5 3rl#ife^rifc*#5 

[0 2 4 6] ^DCTilgro^^^T^FlC^I/^TIi, 
Yukihiro Arai, Takeshi Agu 
i, Masayuki Nakaj imabti5Th 
eTransactions of the IEIC 
E, vol, E71, no. 1 1, 1 9 8 8¥ 1 1 £ CD 

^io 9 s\zmmaiMtmx m&<Ditfi<D&mD 

ff-t-S *dc T3S@*cogflfiai& 1122« 
5XAS:|][«ElKl 1 2 2+lvi^-K^iTi: LTffiR 
[0247] 3. 17. 7: 

{csijcoir^a^p y^c^v^Tfftftf-^^ h y-A 
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[0248] #*W<0»»fc36JKW"ett, T-Xft^M 
<D9 n 9t^bK 1 ^7-7^^1:^0*; 

5ssa^7^^a*«*fli^5ii:«sa*u\ rft 

14, 9J©l»«US^oy>K:i3V^, Kt>*f—9frh'*"< 
T^ISt?#5o /<W h»^3ftfcf'-*iWifc*3ft3 

t> Kh?—9wf—9*s7 Ytm-&t> A ma&-?n yy 

h^^1t^—fi<r>&.mt^—l3^7 h^n yy 
fcft, f-9i/7 h7~uy9\z.*>^Xi/7Y$ti1t\t) 

[0 2 4 9] i^**J&»&AA3 
ftfcifJHfc tr y h 7 j -}V K^»#*ElKT«m-f 5. 
«*tt©ffl*tt, (v) *A#«#03*R<0*$ 

(m) XhZ>. r^T% mlinOTt?fc5. *fc, 

h7-f-^KO*S (a) tttJ73i-5 0 "T\ 
a ti 0 SJLh0>fflrc fc 5 . tf y h 7 w -/V Kf4^ 7 

▼ i^WKbSHTk^teV^*, tC1^7vy^^ 
ft5o 8HH»©A2j«P©g$n©lfy h7^-A^KH* 
RW>ff*£JLb&e a^«T?»x HI80=»- 

K* (m) Srft^U ttOfcTy h (a) ©fiSfcfcfcfc: 
W7n^|:glt5, «|7n^ |Ji/7H (a 
+ra) «rft£U ?—9 h^ny^SrjBIb 
bT&cD^IM 9 Mcm*.X}.tl7 ! -9 ZU7 Y-f 
5. 

[0 2 5 0] «8«OjSI-?fi 4 At>ft%<D% 
^^^-^^ftT^ftl^lfy h7^-;v 

K»S*«:3f£©WFWrtK:a*i-St>©-C**Hi, RO 
M, RAM, PLA#£<0^a»fc3»^|§H6©«^W 
*fflV^rt«s-Ct5. #H«S0!|-CB:, 3HHtftt, JP 

J $>AC7>uy9xm*ttitl-rZo JPEG£I*P 

/uhlt. DCi:AC«*©«Srft«-r*<WPVfy hSr* 

Ltli, J PEG$qi|c815£$ft-CV^4 5lct>i:<DA 

^T-^ l^**«>/<fV ^0Ws0>rts*:? , *y £■t■ 
.5£^:lcJ:•^T:|*ttl$ft.5. /^-r >^ii*ttf-# f 

fc*y h^tf)^-;* fc*y hc7)#&{c4oT^£ft5o 
^yy^^q^-f^TcDtfy H^IrI- (j peg«**t? 
{41) "Cfcftff, ^f^y^lfy hi L"C*»r$ft, 
^$ft5^tft<x-*i^*^fe&Ms$ft3 0 *b 



?<7)rt^f4H#r£ft£o 

[0 2 5 1] *H©SEKWTN4, «%n^l<i:lltM 
©gsfc^CT, &tly : -9<D7*~-?yy>)mZft0 
mtl7*vyf&m*.&. tH^V y 9 \t, JPEGi£*i 

ftTV^^tfy h7-f— A'K'*, J PEG}dtJ(t5"^— 
* <£> 4 5 KflEfll $ ftfc ATvM h $ ftT v x fcv % 

[0 2 5 2] JPEGfff«2 4 1 (H2)t-4oT 

rttin tr y h j t i*ff ft 3 »r3WM$F*ft; $ ftT ^tt ^ tf 
y h 7 W —A* K , r/<7* ^ V y* 7 -< - A> Kj £ Pf If ft -5 

*j wj^ftfc #-^{tsft-cv>&v^y V7 4 

->vYtM^9 V-7'£ivtz-5im&^7-?yft J i§-it='- 
-To 

[0 2 5 3] J PEG^fb§§2 4 1 »/n7v/|« 
"PO^WH^T-^ 7 n -SrH 8 3^084 ^^^0 
08 314, J PEGf-^©^7vyItS®M&P 
*«!l*l,TV^ 0 ^ MJ y/<l 1 7 1 ri-v-*W^- (ft 
fFFXXhex, XXB#f) Sr^*L, (ft 
tFFhex) Srlf AU ^M 1 h (W^F f h e x 
<ft%0 Obex) SrfSiitfo r.ft<bl4-r-<TA7jx- 
^^/^ M|yT|$ftfc||mXJfo*3 . 3 2 t*y h 7- Kt 

Afy M4, A^Jfy h V-J*<r>%>mtt£Z<> * h 
V y/^1 1 7 1 7?}4, ^-f hKH&ftfclf y h7>f-;u 

k^, s^-r^ftJi-v'&^tikmm^toyvyxhv 
-Atomz&^xmmzftt>tiz>m\z, wf-9frb 

[0 25 4] A73X-^I4^ h U y/N 4 1 1 7 1 IC? n y 
9MZ. 1 OC*t03 2 t'y h !7— Kt UT A^3 $ 
ft 5, AT^M" H211 Sr0*>6>3^(O#^-#»tS:ig 
8 5(C7jt1- 0 ( i ) h^Jf A^-T te^-< 

(i - 1) i'booaoo^ MS* M; y/<l 1 
7 1 (OHJ73-e7ilciX7 h$ft, «^ (i) 

y^l 1 7 l*>bW*$ftfc/W h©*SM4tt, 0 8 5 
tc^$ftT^5BiJWW7}y-y*l 2 1 2KJ:oTfftft$ 
ft<5. * h y y^l 1 7 1 ICioTl^*$ft^VvM h 
{4* h y y/^c*3V^-C&i§fe-?tt)73$ft-5. a*«f 
*4 hit, ttfo-fZ>'<<< h*«*S6 (* h y 9^1 1 7 1 
*s (* h y y/<l 1 7 1 T-|^*$ft 

ft5, y"y*12 12f4, 7 i —9^79i:mLX7 ! -9u 
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i?*9 1 1 8 2^<D 1 f—9'U h<Du- F&MffltZk 
irfcK, i—1}i/7 9*mLX'?"-%l'*S7>9 1 1 8 3 

b kttit)9?*<DMl 2 1 3*S^$*VCV^. 
[0 2 5 5] HI 8 3(C*5V^r> 7°\^i^7 9 i/K^ h'>7 
^/n.^H72> 1173, 1180, 1 1 8 1 CD 
&»JI4, f—fi'&xir l l 8 2fc*-*^# l l 

3o 9~>7 9k-^~ j}*/7 9zfvy9\&, 7ui/y 
fyuytttf* Y^7 97~ny9 kfrbf&Zft. Zti 

7 9ft* h!> y'<l 17 l^f>©7*— ^Sr«iai-?.cO{r 

a-* $ tvfc^ 7 t >m t m mcv&H&k. ^trt 5 m. k & 

5„ /Kx[-->7^1 180« 118 lCDfflTJtt, 08 3 
|!^S;h»T^**$lE#JtFf-a 1 1 8 2, 11 

8 3teeS*si£$il3. 

[0 2 5 6] 13 8 6(Ct)f r -^7°l'v'7^ 1172^/f 
£*VCV^5;&\ ?—97V*/7 9 1 1 7 2(4, Xh'Jy 
/<1 1 7 l^PjCDx-^tCS 2f©fntft/ht*7 h 1 
2 5 lief** U 6 4tfyMc5*— *4rlft9W-5. 

t?, i£§gr-*(46 4 fcTy Hf©'<WW:7* 12 5 2 
-C^K^-*^** 1 1 8 2K3t&#ft-f<5lf y h& 
fc(t->7 h$ft5„ r<DE^, tfy HRMU 7*~* 1 1 8 
2 S -v-* 118 3 t^**rtfc£;h/Ctt©*8b*y h 
##SlN5*fctffcMLT^a*|#Pn5>y* 118 5 
X.£>ft5„ ^LT, /<WI/W?** 1 2 5 214, 
6 4tfyh£, 6 4i»2X lS^v^fT'l/i'tl 2 
5 4^^5?;l'f7'l'^tyn y^ 1 2 5 SfcteSH" 
5, #I*2Xlv/^y^fl 2 5 4(4, rt\s>V%/ 
7^ 1 25 2*6©1 tfy hif-^^^ 118 2 
#>&<Z>1 tfy hSrATJir-TS, 7*-* Ui>^^«f«)lfy 
b^SScCfc^irt^-^U^^bfy h£HJ7j-r 
5. M^)T*fe?)^(-(4, /<WV->7? 12 5 2 

©tfy h&titfrf-5. t^TOl*-7^f/l^?tl 2 
5 4~©MlP«#tt, SI 8 6 ft feWcH 8 7 (C*5tt5 U 
v>** 1 2 2 3 <D y 7 tfy h 0. . . 5tL 
■C* $ *VO > 5 i 5 KM*:/ n y ^ HWK 1 ft* 

V)V*/79 1 2 5 5 (C3££>*1, El 8 6 tC*£*l5 <t 5 
fc 5 tfy MM#PflHK'7 YWffi 2 <fc 9 -£x. fe*i,5 tfy h 
h 3*1,5. -*tfc©tfy M4, 5*-* u$> 
1 1 8 2K43V^"Caf— ^©a*fc±oTttffl$ii 
5fy Hfc«r*Lfcfc©-C*9, W*/>7^V3-K 
ftti<f«0t'2' hft, fe5v>(4^7 s ^^y*tfy 



(4x-y" W*9 118 2^<D^\f y h&tfSffll&£*t 
5tfy b*«TT?fcfttf0&£bfcfc©i:45. roi 
^WU->7^ 1 2 5 5 36>f»m^*4l57*— ^ 
(£(4, *-«#*-f ^/KDmt-x-^ US^** 118 2 

-9Ui>7,9 1 1 8 2©ft^(4, **tfy 

5fcft|:i/-^^^->7 h£*v, * h y y^'l 
17 1H0, 8, 16, 24, 3 2 Ify bit 1 ?— 9 V 
V*9ll 8 2lc^3*t5i:v^fcA£'lcSHES;h, 
5, 5*-* i^x* 1 1 8 2fc«*-C#5*f»t0>-HM: 
tfy h*5#ffibft^^lc(4, * b V y^<l 17 14>b 
0f—jr#1f&-1iM3&lfr4 9MOs^Xu- K$*v 
5. SUM 9MZ.H^X7- Y V y/< 117 1 frhV>7— 
^^#&Lft^4§^(C(4, x-^ 1 1 8 2 

5>fttf y h^-CftW^tfT 5 -^ 1 1 8 2<Drt^ 

(4^H3ixftv>„ 
[0 2 5 7] •?—%7'\si'79 117 3, tf* \-V79 
118 1, T—jJU'JXf 1 1 8 3(4, f-?/l/'>7 
9 1 17 2, 7 s — 9tfX Y*s7 9 1 1 8 0, 9 ^"J 
X9 11 8 2 b*tl?inffi-<D%mX'hZ>. m&l 1 7 
3, 118 1, 1 1 8 3fi<Dx— *7n— j5tt>tf»cr*L 
b<Dn&r$<D7 , -97u-i>, «Mti 1 7 2, 118 
0, 1 1 8 2M-C<D7 ? — ?7n- i:PJ— "CfeS. 1^^© 
M»fll*^*»« 1 1 8 5 ± 15 W3<omL± y Hc«e» 
tv5. Z.ilh<D^nm^\t. ■v-%7°\'^7 9 1 1 7 
3 b 1 r—97°V*y79 1 1 7 2 ©ATJT-^SSUi: , ^ 
118 3 t^— * 1^** 118 2<D^ 
^^<D«t5fcfflv^p5jt5^, <!^5/&-efc5o 08 81: 
^•f-J;5(t, ^ hy y^l 17 l^ibo^^l 2 6 1(4 
8 tfy hy-Kfc LTA*§*1, f-9\s*s*9 1 1 8 
2\Z.m*~0*f-9'^ hrtli2 tfy Vn*)^Xhft,X 
V^5. SI8 5(^Lfc^{k^feiC4^(4, W'ov 

-tf&frc&s/^ h*^-T2 tf y h^^ojycty h 

(41-C&5. * b'J y/U 1 7 ia»e»|B|l$K:a$f>ft5 4 
O(0^i/(7?g7ct'y hiStfllW^S, •*-U7'\s\/79 1 
1 7 3CDA731 2 6 2 t UTiltH$tv5„ ro45(-L 
T, •x'-*7 B V->7^-©A^(C(4, (4C«)(C!f?^$ 
tifcx-^tfy h-C-r-*©«»fc{t«1-5ffi*Sr*i- 
1255-fey h$tlfcfy ha$#fiE-f5ii:{Cft5. 
(C, r*lb(47 J -^W'i?^? 1 1 8 2«pt?-v— *^(C 

m< Ktv>\z.nwkzti>fr : f-9 tfy t<D®.m&-?-9 

LTV^5. -r—jtisi?** 118 3 4»©-v— *fitfitfy 
ht^-^i^^^^ 1 1 8 2tf(Dr-^t'y h©PW^ 
ft^5#vnc«toT, WfP^ny^l 18 5(i^f^ 
^ify t<D&m j $>tim%ft5zt#X'ZZkkbl^ '& 

Ct^T#5 a ±j£<Dii<9, 2 00T'l/->7? <cf—9 
117 2 k-*— 13 117 3),^ (f*— 9 1 
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18 0 t^-iJ 118 1) , {7—9 118 2 

118 3) \$.m-<n®mi9%-%$-z.htiX^Z> 

[0 2 5 8] 3HHB1 18 4 (08 9(Cfe*t&*VCW> 
-5) i4, 7—9Vi?*9 1 1 8 2cDg*l 6 My h£A 

5) SrttW-r*fcft©»*-g-i>«riaKa-^l»i 18 4M 
&oT, 9vy9V'f9^ 

tttt&fev*. tt-MStt, H8 9!:*tj;5t> i 6 tr y 
h h — £ * is'Jx? 118 2^?jA*L, ^7 

^yf (8tr?)0 , #jfcl-*/N7T>'#*fl:3J'ifcS' 

(4 fc'y b) , ttfaXy b (4 tf y b) 
5 «fc o ft»*£fc*iaj&^n •> ^ i UTa- Kl7-f -V* 
ftfc 4 o© P L A * 9 A /W&^r'r — ZfA'^m X. 5 w b 

[0 2 5 9] /*r-f >^fcfs> HDBU88i»aW:, ffillPttl 
1 8 5©-Ultfc5^f>fV^ey bO'&^mcm^X 
^-^ l^v?^^ 118 2 tf-C/^XW >9Vy bm^m 

/<7V >9Vy b<DU^-U%^ir a T—jJUiSxf 1 1 
8 3, 1 24 2©8*At*2/ %&U¥y b& 

ftLfc^-g-tCtt, -7—* l^*? 1 2 4 2tpWT-* t* 
y Hcft&ofc* ?/ Hc^t5f-^l'^^ 118 
2, 12 4 1 tuDir^XWy b &%&<0/<7 ■< 
®b LT«fr£ti,5„ Ifiro^f-f V^«*»rt&tt, 
/■?7*W V?\?y h&fcbgB 1 2 4 3 (C iott^T 1 Xh 
%frb*jfrit7*.y9$ftZ> 0 ?Wfito1 m '< 
X<Dt'y btflX-hZ&&K\-i, stf-Offy bXh 

BlttAtiltt, 7-9 \si?X9 1 1 8 2, 1 24 1 (HB# 
\C-?—%l/i>X9 118 3, 1 24 2) <D\H®%'ttJ&1~ 
f,->7^1 1 7 2, 1173, 1180, 1 1 8 1 Irffl 
^Tl^Dy^t^ ^;WT*feiC'>7 h $-£5 ^ b X'ttt> 

Ki:iHi--e*>s. m^f 5 >r yfrnsstw? 

icfrfetu 7—9v*J?>9 l l 8 2ic<^^s' h 

[0 2 6 0] 18 7(4, ffilWftl 1 8 5&#»K:*Lfc 
t>©T*fc5o ftimU<0* , 'iMtiL\*^ ; **9 1 2 2 3ffc 
9, 7*-*^** 1 1 8 2*P<T>&%*foVy 
LX^So 118 3$><OG%i\?y hSfctt 



ft\Z.7-9VV*9 1 1 8 2f ©brats' hl^fc^t 

*S*9 12 2 3\a&®\£tl<Z>7—9\s~/*9 1 1 8 2f 

1 1 7 2, 1 1 7 3, 1 1 8 0, 1 1 8 1, 1 1 8 6, 
118 7, tt-JHBl 184, miT7±—?y hSBl 1 8 
8^<DMflWt#©^T?ft>S. !!5H<Z>«£i6(4, ±5£©4 
l^** 1 1 8 2t^f-f y^if V b<DWi 

[0 2 6 1] 9 yy^9 1182 f^ff U^tf y b 
|(ne w_n o b) {4, 7*—* l^** 118 2 (n 
ob) qKDVLMy YWbWM 9 A'X.t&^X* V V y* 
1 1 7 l*»e»n— K^TI6*tfy (no s) tcDJP^ 
L, ^l,•!^-^'^/^^c:*JV^-C7 f — 9 W*9 l l 8 2*>6>fJ 
H^tvSfj'Ht (nor) Sr^»Lfcb©ibTf+^ 
::t, S.f--f^/H4, «-^f--r^/Wfc5V>|4 

hic{4«T©J:9»<:lt^$^S. 

[0 2 6 2] new n o b = n o b + n o s — n o r 

ZtlbMBMttXm&l 2 2 1 fc&gS&l 22 2tTl 

i^e>7 ? -^^A^$n*vN*^{c»4 (nos) asot 

*7t, <f—9V 9 J*9 1 1 8 2(C*JV>Ttf?/ 
J£0*v\ EP^f*— ^ Vl>*9*?Wy bi>mffl& 1 1 
8 5frh<D&®&&bM<ttM\fy b&b<»mj.TX'h 

V^^-l-fc (nos) {40 bt£Z> 0 ffi (n e w__n o 
b) (46 4%ffiZ_Z>Z. btfh*), 7a y9 1 2 2 4{C*5 
V-Cjg^TVS^if 5 fi*&7*. y 9 $^5. -©4 5 * 
^hy^H71 (4^it^ffi t ft 9 , *f b 
^7— 9C>v*-b*1)V££fl%1 / \ -?)V^-?V9*y\ 2 3 
314, 1 7 l*»t»o-K$ftfc^y h»Sr 

■€n{c-t5fc«)^VN?jttSo ---C, ^MJy^ll 

7 1 «ri»ih$*«flr*ttBi**Hr^ftv\ a-wui 2 

3lHOfff r/^^yj/f-^^/Uj (4-7 )Vf-fu9 
f-1 2 3 4*«#PU f*?4-y9*y M!cfc5V>f4«# 

^tlfyhS (nor) t **tTy h» 

iSf-^^^fDl^s/Ht (nob) i^±T-fe5 
t, tt^Hl 2 2 8iC*5V>tflJ»f$n5i, ^)\^7^ 
9 VI 2 3 4t#*.e>H5^7 h+^^ftbtry h^{4 
NAND^-M 2 3 0(C*5V^^n(c^$tl5o 1" 
(nor) (4^n(c^!£$^, ^-^U^^ 
(Dfs/ b<DUm\tnt>iX^\ vA4-7l'9+l 2 3 4 
C0til^(4, bi/79 1 1 8 2 b 1 1- 8 3 ©(BlflPtl *> 
ffll/^bttSe 7*— 9VXP7.9 118 2©i|S(4x5' Koy 

A \ 1 7 l^^Oft^t'j/ h^SrHX^1--5fclt«0||JliSSr 
f-?^? (c?S^1- 5 «t 5 K , *> 5 v ^ f4m-^/^- ? r 
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[0 2 6 3] &%-<M?Mz.&^xmifc£ft>2>Zy h$k 
numtmw^i 226 K*sv^*fr$^5. 

K Kfc-IHH 1184i»e»A**H5. 1 

6 try h©ff**tt«HM»fc*Jv^-c" oooo" tnw 

itZtlZitlby " o u_r e d u c e" B$?y^ 1 2 2 

svttr oooo" ioooo" \cft%ik£tis Si 

1186*118 7^©MlPflr»***.6. 
[0 2 6 4] /D-^ 1 2 2 9liEOI (MMtT) ^ 

P S"<1 1 7 1 C*JV^TW»**b*^ * h 3 y/<l 1 

7 i -efflmsttafinHrfcEo i ^-#K:ft3fcote»;:# 

ittS^ 1 2 2 9 ^?1 2 2 3|I 

t6lW£;h,-C^57*-*W>** 1182<fOiry Ufc&* 

tt, * hP y/<\ 1 7 l*»fefrU^-^ttA**ni* 
(5*— 9\s*S*9 1 1 8 2fr'&%£iv67 : —9$Wj%>0 
COtTy hSrft«PLTV^> , R9 Ify h^B'JI&^ftfc 
E o I -7-* ©t&tfv^yV :/y*fW£lM X^-fr t (' 

ft*. $ h***?* ^«*©fca* , '<? f -f' f^tTy 

[0 2 6 5] A\s}\s*/79 1 1 8 6, 1 1 8 7tffl^7 
* — ?y hgfll 1 8 8 h-rS8»J&*rU ^ 

«HWHHi±*<o J: 5 \mWffl> li85J;t)#^e.tt 
So ttflltf? 1 1 8 6\*7*-9 \s'J>*9fr 

*b 3 2 ITy h- SrAA U SW=«^£*VC^5'-«:7t VfiF 

ftTV**W*Mfc<?-^T©tt*iqtf y H*x ^Wl">7 
^1186 ©fflajc^fcHtrfctctt**-* - b tcft •? , 

/<WS79 1 18 7^<DAl3t LXT&btlZ. #*Pt* 
y 118 71*, y-Otttfj?*— <?y 

b t LTfflV^tlB19 1 fcfc^Sfl-C^S 1 1 Ify h7 
-f-^K»C*5^T, £gW>k*fg?'JKfWDfc*y M4« 

9 6(C*5^-Ctfy h8*»febry h 1 8K&$ft 

*»tt«»-e*5ri:fcS>*. Z.<D\?y hffetej PEGf 
3¥T'M^$tlT^5i5^1 1 9 6<Dtfy KOi»6>3K 
ffM^ftSo ry b<b LT^ft*7 

Wl/$/7 * 1 1 8 6 , 1 1 8 7t^W^flSr^S1-5C 



fctcftS, 

[0 2 6 6] UA7*-v 9 b^n;y^ 1 1 8 
•tv^+SflHKrfih^ JPEGSitttiWl 
1 8 5*>F>-£;LM-VSDC/AC^fc (1 1 9 6, tfy 
h0i»67) £DC{fcgfcJg^fc*y h (119 6, t*y h 
1 9) , ttM^y Vtf* \-ls79 1 1 8 7 ft 
SttADtfy h (119 6, tTy h8d^l 8) , T— # 
u*s*9 1 1 8 3*»P>^-*.e>*v5^-*ffiB^y h (1 
19 6, fy h 2 3) 4:SrH9 1^+7*— ry 

ffil I88ti, 7 a« 

/^7-7^ : K^«^^*6^)ft«•f-«lS*ti«U, its 
[0267] 3. 17. 8 m&M&tii$ 

< 2o©xj) Ttc^ttktiSo — otey-^H&cDif© 

s^fyT*. t> s—ots&mtt-v-f+y?!) y9/mm 
©a^fifoTtr^-fe/vrttcffl^iii^Sr^-rs^^y 

[0 2 6 8] 0 9 2 ft, -y — *RflW>B9JfcfW&&*a# 
{CiiJSft^^y7'7 2 0O7D-ft- bSr^UT^ 
•fjVfrl 2 lT*#E$n*o ftfe<OffiF^®a7 2 2 

tfib<Di)-yif^7 p y v^«tai:v^fc2o©jjHi^a«' 
h^^ht^s. as. ffiiwtf-^f-^'; v^i:ttS'J 

tc \?9±flom* $ * £"*ft#i£-CS[# 7 2 7$ft, g ft 
[0 2 6 9] Ht!^^©fcto©^y- Kff**H 

[0 2 7 0] 
[£19] 
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S 




offt30] 




P 




C 


y 


A 





[0 2 7 1] lW**^?^K*IS*7<f-A'K©lWJfc [0 2 7 2] 

£AT<C^-To 1^2 0] 



Operand A 



Operand B 



Operand C 



Result 



S£i?i 



kernel descriptor 



Source Image Pixels 



unused 



pixels 



[0 2 7 3] 

|4*-^|Hj*^-<75S^^'7>i~V2/ hSr^U 09 

1. y-*Pr&M*&&&7 3 0 («F»ftLOHS*, 
2 4. 2m&M) . ttlt (0, 0) *SH«©fe±. 

2. tK¥7 3 1 tmU 7 3 2 (f-^f-^-T 0 ^) 
(2<D*f$C, HJfcft, 2 4. 2 4fl?&g) 

3 . fca©H£«fr?ijff » , fr0v-w + y ^ ^ ©ft 

HSr^-f 3 bObp7-f-^K7 3 3 



other 



pixies 



[U 2 7 4] flJlB-fo^' ^WWWUfl lw fcicM-^ft 
tRjlcife^bHSo HI9 6(c*5^T, ^5VKBJ4y- 

0 9 6^$^TV>5 J: 5tC N f<7yKB7 4 0iW 
Vfy9*'r—-Jfl'l 4 1 &«U -iv 1 ?*;*?-?--? 

fi7 4 2) ^5*iTe*5. -Kfc. >f^fy 

[0 2 7 5] t^5V KCtt*¥/£*if7'?-l'7Vi'W' 

ft, 0 9 7 t^l" £ 5 (CH^^^Wx-^ 7- Kfc 
*?S-fc$*VC<^*. &3k¥7±/l>P [N] W^-y^/VN 

[0 2 7 6] 
[ft 4] 
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prnl -= (l.offsein] ■ mcfp^r'-OOOO) + J 

[0277] rtBB&Riii, m^mt^^^^t<D3 ' 

[0278] 3. 17. 9 Sii^^- 



J w rjC -s(x+rAx f y + cAy)[n} 

-^/uasfcttta *f ? ir^r t fc * -*/Hi fc'tt » $ 

u *a*«at»i445-a*if^-feA'ri:{ci 

[0 2 7 9] y-^BfttfftS (x, y) £WU nx 
m*&*#— ■♦A'tffcC (x, y),«r*-f5t, StC 
OI^H [n] On#g */W4, 

[0 2 8 0] 

HS5] 



[0 2 8 1] fe*V5. -Cl-C, ie [0, c] , 

j€ [0, r] "CfeSo *7ty H©S«, *Pfl*£* 

7 5 1 Icjgffl L> ^*W^7 5 2 Sr4«i-5WSr^Lfc 

«4, HfcSHfcl&^rattfcmrflS. #^7VKt 



[0 2 8 2] K 
[0 2 8 3] 
[S21] 







s 




c 




p 




A 




cfft3:0] 





[0284] 3. 17. 10 -ftnWg- 

[0 2 8 5] 
[f&6] 



T©7d— 7j/ K*W1"?>o 

[0 2 8 7] 
[«22l 



V 

























"OA 
&2.1 ^2.3 
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Operand A 




mm 


'J— A 


Operand B 








Operand C 








Result 


ass 


a* 


■J-A, Ay?2 



[0 2 8 8] 

Wf^tiit)!:, £(TO«tWt-t^3-K [0 2 8 9] 

7 -r -/w F*r*f [* 2 3 ] 





SeW 


c 




p 




A 





[0290] 3. 17. 11 h-yft 

J: 5 4 h 5 V * h - 1 \c 1 K T V< y * * tifc 
«fc 54^y^ try I- (2V^-7 h— >cd4§-£) *> 



*V^tt«F* (2tti7jW</W£JUbCDj§£-) 
[02 9 1] W73^-7 h-VffittSHTOSfcfflV^TW- 

(PX (1-1) +d) /255 

pttfc^-fevHI (0^p^2 5 5) » 1«U"< 
A* (2S1S2 5 5) , dHC^-flPfTJMI (0^d^ 
2 5 4) *^?^W#ttjrF<Oifi •>•?**. 

♦^"^fFfcttJftP-F 

[0 2 9 2] 

[*24] 



a 24: **^3-F*J:tfyU , ^h'7-F 



— ===== 


I" 1 1 " " i - "" 






Operand A 






-a 






Ay£2*lfcA-f 
h Ay?£ftTH 


Ay^SnfcJ* hU 
—A 


Operand B 


dither matrix co- 
efficients 


B&, Ay**nfc 
TUfcWH f- | 


Ayi»<*nfcXMJ 

-a, Ay^snr 

Hfc^A-f h 


Operand C 








Result 


A- 7 1 — >3- 
H 


bx. Ay^cYtifc 

A-f K Ay? 3*1 


Ay£S*lfcXh»J 
-A, Ayy-$ftT 



[0 2 9 3] <W*7-K«F-»-Ctt. ^a-K 
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3. 17. 12 »JiMH*7*— 

^tlSHS, lot, J PEGSl-ii-a^^-T^I- 
[0 2 9 4] H 1 0 2 Tk^MMfea^-r. Htfl* 
6 lliA^)^ h y-A6 7 2<D2{g<Or r -* 
£fct>, ^CDr-^ffi7 6 3tt^S$tX-TV^7 6 
4 0 111 0 3\*4fe<D7kW$m&ftomX'h5. PtJllft 

i0t7 -feyl^JSr 2 f£& 5 V >f* 4 f&tcSE^T y ^ V 
^1"5 0 dto^-rs/ :/-?(*, 1 t'f±;um { *^-y>' 
HA, «KDJiJ^*^7^KBift5. 
[0 2 95] SE*M!©#&fc:W:2fiff, 4{£t*>P><D*§ 



y-A7 7 0, 7 7 l?SrfflV^2ffirtiHI<Dm^^b!J- 
A7 7 2t 4ff fflrawtt)*^ h 'J -A 7 7 3 fc^J^fS 

[0296] ssta^teiit 2o©7-^ h y-A 
ka) tt^-^tf^ h y-A-c*>9. ^-^hy-A 

(^7VKB) ii^tt^ h y-A-Cfc5. 1210 5 

ic, mnM'£t9mzm^itm&<D2<><DAXx h y-A 

7 8 0, 7 8 1 <fc#j£-f Stitfj.* h y-A7 8 2 &7jk 

1% 

[0297] Ell 0 6tt, MfJlWiiUfe^*— 

[0 2 9 8] 
[£25] 



31 25: dl^7-K:T^-i— **<3-+*7^-A-K 



1 7*-/H* 




f R 


0= mm 


I V 


0= 

i = mmmm 


F 




C 


0 = t£%am&!? ? >7* LfcH 

1 = &&<Dfa&i' 9 >7*T3: underflow to 0x00, overflow to i 
QxFF _J 



[0299] 3. 17. 13 ^alf-^ 

So 

a. emBT^&tMM* 

/fy^7i-^ ? f 2 5 2, t*0"fe/Utf-— JJI-of-f 
24 6, J P EGffW2 4 1 , fe**-^ f 2 
4 9, tiitl4 ^^5^3.— /^^i^nT'Dt 

^yf-2 2 4ft<DS# ©f- *7p— /-^fcffiv^. ^CD 
*S\ J P E Gft^fc^a-^ttfWfcftSfcfT*? 
TicEgjiSo 

[0 3 0 0] r-^^»f^©te0^t LTtt^Tco 
• t7*/q Hit (if v K 2 tr«> hffi, 4 tfs> htt) © 



*^y*yr 



b. P-^/VDMA^ 1 

y 2 3 e t«jaw yf7i-^ 237 nw-^is 

[0 3 0 1] = fc-Rfrm, ^7 y KAIta 

(4, ^yVKBlcioTA^-cOx-^SI^Sjff^ 
3. 17. 14 yu—mW&'Ti 



<0$*?*J!e«WifcSrft!j«l1-5fc»w*^-C*>5 0 7n 
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#|£lo(D{fc®T Kl^^bflfc^T KU^^<D^lftS-"5r 



[0302]E]107(C V 7 n—mm^<r>^^7p 

So 

K— ^ -t— K7 ^ — yu K 
[0 3 0 3] 
. [*2 6] 



type 



N 



O 



I 



[0 3 0 4] ^^^WW^V^HfT^ 



00= i>*Vj 
01 = CX< h 



0= 

1= ^fl:-^>^ 



0= ^^HBfc&ttl^X^ttT^efflU * 
1= ^**SBi*<B*#£>fey h 



0= fi«UTt>te^<Sr1*)W!«7T&C>£8o 



-*7*a-KcDS tfy h^Oic-fey ^7 

iLTfflv>5 0 *^7V KBlEiE^Offitei'S^^fDT 
Ki"*4rflfl6U y K B 7 - K©tt^s i/ i^^ ^ 

[0305] (((register value x 

or Operand B) andOperand 
C) =0x00000000) 

5, 

3. 1 8 T^-feyu— $13— KOT^a- ^ 
[0306] 3. 18. 1 fc*^-fey^— #-7Mlf 

-r^ortw^^y n^Mrt$*i/6*\ fcsv^iMuv/^y 

MJ-A££t-*/<*2 4 2*><5WiJ PEG?ffft 
*2 4 1 ^^-feyu^—^^if^ij^- K» 

ii^cDCBu s^^7i-XiaotI^f5;H5 
TtSo f^-feyU^--^^ -f 2 4 6liPO_CFGfrJ 
H^$>**©ffij£1-5 «fc 5*3Eoro*— K© 5 *>©— o 



5. 

(a) T>f K/V*- K : f #7->f if 2 4 6 3* 

(b) V— ^S^yl^e— K : At)*T~9 liftgPF I F 
6 fix- # <D 3 2 t* y h T K L"CA7J-f V* 

(c) fc£IK3E&*-K: tr^-tr/ud — ^T-^if^fe^ 
MUV^5/7 7 2 5 0\Z.&Mi£hX^Z'( y?—'</v>& 

(d) JPEGffJN-K: l?****— iS^-f2A 
6 tm®.*?—* £M C UWfS;tMU V/<y7r icftiM 

(e) ^*Sd*#*tfH4MEifc*- K : tf^-feyv^- 
af-jy -T 2 4 6 ^ff^SrMU V/< y 7 r 2 5 0 \Z» 

#iU ^S^fefttfix-^* 2 4 2 fcfc-tft&e*. 

[0 3 0 7] tf^-fey^— iJi-^f 2 4 61*^—^^ 
^2 4 2iJPEG«f02 4 

fcftl;MUV/-!?77 2 5 0<B£5„ 6SM«*K:*jv^ 
T, 'l'>'^-/W*5j;tyt^7 1 -7*7Wf4MUV RAM 
2 5 0KJ:ot»iJn, SBlfyb©^-^ (4OC0 
X (4 tfy h(D-ry^-^yW!tt 8 t* 

Sii^SI^Ofc*(C % MU V R AM 2 5 0 tttr?!ltSj(fC 

fx x i Qmrnrnzft, 2 o tfy h-e 

fc5„ MUVRAM2 5 0liHo^^t^^^fc!) 
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*aWIHim\>l&'-9 *x 242 icefcftttfttfftfc 

[0 3 0 8] J PEGffijgfcl&Vv-c, Vfiz/l'*— XT' 
-f1f2 4 6f4, MUV/^7r2 5 04:ffioTMCU!; 
^;U/<y7 7 t5. J PEGffi«©ttflllRLh©fc«>fc: 
14, ^^y7rS»Sr«[5-i:* j a*Lv\ MUV 
RAM2 5 OroiWA^^^^i-X^yf 
2 5 2^f>c07 f -^$r^oT##5i*tv5„ t> 5 

J PEGffffk#2 4 UcaiS^tf- 

[0 3 0 9] t'^-fey^— 2 4 6{4, IH3 2(C 

/V^-^^lf2 4 6^itJ^^|ft^1-5 0 t'^ir/W 
^r—ifi-^iF 2 4 6I4, CB u s<4 ^7i- ^SiJWSB 

8 o lt^nwsito^^tj' howsncf 

l^fPSl fC*5 «?, CBu S'f y^7i- *$iJ$?Sl5 8 0 1 
fi^n-/NV>C Bus Lt 2 3 5 

«$ftT^S. tf^-tA-*— 2f-fvf1f 2 4 eictt^? 

^7i-^^5'f2 5 2A^Stt5, s t^yK 

PO_S A I DU^^lCio-Cffi^Stl/S, PO_S 
AIDl^**!*, PO.DMR^^OLfyHl 

'5 0 JI&T KU'^jKW >?ttPO CD P i^-X^lcJfe 

h«$«lt*iP$*t4. ^-^iPMUV RA 
M2 5 0{C7ay^$^X-5 f-^OSftt?*^ 
M4 P L _MU V 1/ v 5 ^. ^ (C J; o T ffiS $ ix 5 MU V 
RAM 2 5 0co-<-^T K^titteSftS. 
[0 3 10] ^5yK7x5/fS|i8 0 2CioT7i 

F I F08 0 3*SfflV?)H5. T r -?gW£B8 0 
4 14, 03 2 K*JV>TRW L-fcJ: 5 *«* ft«fP*Hff 

5lce^e>H-5c MUV7 Kl/^4dtS8 0 5iii«l/ 
-^^tfotf- *£MUV RAM2 5 0, ±f- 
^X2 4 2> J PEG«^t»2 4 1 W i* *> b tWC^k 



■*~%*f4 if 2 4 6<oif;/^i^-A'±T<£>;fcit>K£>g 

[0311] 1109 fcjSV^T, El 1 0 8 (K-X^y V K 
7!yfS8 0 2 «: J: 0 f<7^K7ij/ 
f-gi5 8 0 2 Klf4, ^/^7 Kl/^4W (I AG) 8 

1 O^StvCSo"?, ^7VFf-^7xyft± 

w»»8 1 1 taie>*tds» n*fta«8 1 1 »r 

£$«58 1 0«S*tMUVr KW^^$C8 0 5©I 
* (0108) £<BM&tt«Lt:fc0, **>©5#*:A 
7 1 ) (MAG) ^V?7x-^^'l'5'f 2 5 2Cl5i5 

i:uv^. »*#*S58 1 itt»*Srft5fc«>o«« 
Wft«r^//-Cl^. :ft|J, F I FO*7^ hSP8 1 4 
•SrfflwCF i FOflDiftJRSr^^ U ft©K*'SrV^o9* 

8 1 2 14 I AG 8 1 (XDtffflZgrt&'O. 

x—XXj 5 2#!J ^-yfStt^^V KIC*5 

^yKf-iS'i HlCF I FOtC*&iW$tL5. MAG 

s*t i AGmmmmicmMLittZs r*#*«8 

1 1 (4MAG®#£ I AGi?*J; UtfifeU-CftBI-f *. 
[0 3 12] HI 0 8 KlioVvr, MUV7 FV*£$3B 
8 0 5l4g&-5V^ ofr<D=e— Kffif^-f 5. w*V?><Z> 
*-K£*5^-C, SlIiJPEG OEffi) *-KT* 
5. rro^-K-Cte, J PEGffiff<Ofc#<DA7J'r- : ? 
#^-**fl^8 0 4KJ:oT(tWBS*U MUV/^5-7 
7 2 5 0liy7/W<5'77 t LttbtlS. MUV R 
AM 2 5 07K U"*&J*« 8 0 5(t 8 0 

UV^y77©7K^S > «t5, MAG8 0 5I4, 
«M £ life tf* * /HI » # 5 -fiSc^x - S:© 0 tb-ffc 
»Wl8*ttJLTKu^Sr^-r-5i:tfc(c, JPEGff 
»ffl©8X8^n-^SrJgfiR-T5J:5K:»fft* S. MA 
G8 0 5S, MCU^Hfkt— fflfiftoTV^»&tft 

5» Hii on, MAGSOSW^/^y^iff© 

[0 3 13] #iiObf^-fe^7*-^|C*JV^s MAG 8 
0 5(4, 4 OO 8 f h R AMOMU V RAM2 5 0 

5„ ISIi:*9- , ? i -t*/w*»e>v*-^SrPll$K:lR9lUi-fc 

MCU7-^tel:^^->7 h$*LT*»&MU 
V RAM2 5 Ofc&jfflSHS. r-^OtlCv'? h$ 

otftjesns. M^ii, aiiitt, f-T'f-vyy v 
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RAM 2 5 Ort-CEBS^i^-^flfiiSr^-t-. 3f-y 

*/K3U44f--Y*/i"('>'y" y — yj PEG*"- KKfcV* 

PEGffiHS^E— KtCfcVT, MAG 8 0 5 (01 0 8) 
tt» J PEGff^k»©*ittlB©fc»IC3 2 tfy hy* 
-^MUV RAM2 5 0\zM&m^iXi>M^-f^V 

t;v©tt% MUV RAM2 5 0t»i$il5f 1*3 

jsitfis 3 # g co^-^MSf-^-y- y >y&htZ7 t 
-fZkixmtbbtiz,, 01 i 2ti, ■r/vf-f--Y^/V' 
•y-y-y-^yy y^*- Ki:fcit5Mcuf-?»- 

UV RAMt)>tbBfrtii£titci¥-—?- J r*^-y? : f- 

1 3(C^$tvTV^5, 
[0 3 14] tta^Pt^lCtotAAMCUiJSM 
UV RAM(rfett$tvTV>5Ffl, SS^-ttiLya-fe^fi 
MUV RAM^f)8 X 8^n $rgg^ttli- 0 — 

^W-r- tiCioT, EoO#ffoMAG8 0 5JCfc 

^fc&vvt, JMrtStiSf-^ttBl 1 1K*+J:5K 

ab7*n-fe^(4MUV RAMA^f- ^Sr^WTLft** 
P>@K^ffl-r. 01 1 4(4, r©J;5*^o"fe^©-W4r 
^1% 01 l 4lcf4, W-o<n=f-^*j\s ; f'-f l ictdtfZ>B 

^WLi^V^^tl/t*? 1 ?, MUV RAM250O 

[0 3 1 5] feSJft*— KIC*5V^T, MUV RAM 2 
5 0 Vy--/VW*5 J:t5^»ffiSr«tffl-f 5* + y 5/» 
iUfV^tl, MAG8 0 5tt^ty>^©M« 
«Pt LTiK J:5lCftoTV^5. MUV RAM250 

ii3ocD*7-^^^/HI^r :3 e^y>3.i"'5<, 

9=-**/M42 5 6*f<D4 try h^^-^/l* 
J:V5Hktt«r#rs. DMUSriiCfc#if^'tA'W*fc 
fc^T, MUV RAM2 5 035>ibffi[iEfit&#5fc»i>K: 
MAG 8 0 5astttah,5. wOffi*S#5>tu*V^f t» 
MAG 8 0 5«:^tfCV^5^^^-^*5j;U?^*ffiSr 



[0 3 16] B««8W3j:t«E&#»*©fc«>fc, MU 
V RAM2 5 0ttMDP«)ff5!HRjftS:EttLrv^. 
MAGI4MU V RAM2 5 0\Z.&M£tlX^Z>£.X<D 

i&fetCljofcoT, MAG 8 0 514^7^ K7*y^« 

(0 9 4) fc/<^h8*©*iff?ij«»i:4r 

[0 3 17] 011 Ste&VT, 0 1 0 8 (DMUV7 K 
\s*£&M (MAG) 8 0 5&i«9§£j|ffltC*-fo MAG 
8 0 5 li I B u s ^m^Mit-fZ I B u s gjfc^-v^ 
-^8 2 0 ftd^TiS 9, I B u s g#«:Iiift£&»J#l 
U ( I TX) 8 2 1 t&£ffl£& (CSC) IKWA8 2 

■fe/v^— if 2 4 e »B«3e^, fesiwfc*© if *> 

i o<o*- VXWifti-Z X b l;4otv^fc*, ffl 
5„ IBusS**^a-/l'8 2 0lt ^7^K7i 

[0 3 18] J PEGKW8 2 414, JPEGMU 

h-wm&t j PEG^aua^tvNp 2o©:i£i®a 

«Sr#^.-C*5«3, J PEG^-Kfcfcv^TffitoJxS. if 

SSvs^J'tfliv^st. b\zi.oXi3]LWz.mm&%LZ<, 
J PEGEMWiftK&^X, DMU(4MCU7 s -^SrHi 
#LMUV RAMfcUMfr*-*. JPEG#*&*«HIP 

S5f4*¥^r w v^t tf^-feyv-y-y-y-y^y >?<Dmm 

So 

[0 3 19] J PEG»*ii*«M»»tt, y-^H4fc»C 
*J»t*DCU*sJ:VDMUttJ*tf^-fe/uogWEttllSr h 

CU^MUV RAM2 5 0{r##ii*tvfct#{C, J 

fcf4y-fey h-fSr i(CioT N MCU*spS«(©*3iy 
5 V ^f4Pi^©ft^g^ y *»S:*i-. 
J P E GR^W L$!l#S|5f4, *WEU^^©rt*{!:a;<5 

[0 3 2 0] J PEGSt&^fMWtottDMUtftfj?*- 
^fthyy^^^L, DMUfflAf-?§MUV RA 

M2 5 o\cmti-rzo m&wmmt, u&xf-ty 
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itWMCUASMUV RAM2 5 0 
tltcbZlZ, tfnESlJPgPttMCUlfffi^ J PEG-RW 
- I PCfv^Clgf J PEGK^fiJLffl 

[0 3 2 1] r(D0JPSi5li, S&COMCUaSMUV R 
AM 2 5 OiC## S LEEPtIlcA(3S 

&<D^^S^7i-5*-C-€-^Ji(C^5 0 JPEGgl^ 
ffiU!l»SP(4, MUV RAM2 5 0\0&®\iStiX^Z> 
MCUA>f,8 x %-7Tiy?%Wi&\tite Tfr^^&v 
f^-fe/W;::fc-^Tf4, ifi-iJfP^/55MCU?r^lH]lc:iioTic 
^tt}-f J^lC&oT*^, MUV RAMCMJtvT 

[0 3 2 2] iCO$IJ#pg|5(4J PEG-RW- I PCIC«t 
#/5^^tti-f-5„ Sit^x-f Vi/(4MUV RAM 2 5 

X-^i^^t,? yffzmz-aiu U~-*;v~. y y* 

MDP2 4 2K£;i3, ^LT, po. lenl/^? 
ifefcit^&^ift^lc&v^T, PO-2 4 6(::<fc.5:£TCD 

[0 3 2 3] #-*A"~» y 7 =: y ^ $ ft* f&l 

mm&om® b tr v v is, 7 * y ^-t-^s $ a g ©firw 

DP{ce^?>il5*S» ^IJ^ttMDPlC'gx.fcttSBfl 

KIM y&mZiiZo Vf-teA^y-ris-?? 8 2 5 (4, 
^ * £ix-o ^ v % 1 1 K , zr^szf 

m&i'VxfitmwiZftx^z tz\c, vf-v-yy 

7 (4A#r-* KM L-C¥$«rttSa»Xf*ffl5l t fcff 5 
[0 3 2 4] MUV^fi-ft^v^ — /W8 2 6145I&7? 

Tj7x*hz>fflfflm>hMvvm%-mi,&&vm$ a* 

rt^fi-ftlflf*, MUV RAM2 5 0& 
«5 LT» »*WUTK^a*Sr 

3B£. MUV RAMSt &<^7 KW*J4MU V#»{t; 
^3.— /K£>8 hWv f ^^iC^$a-CV>5„ MU 
V RAM2 5 0 £ffll^5*J|SPSBf4#;tf>MUV RAM 

[0 3 2 5] MUVfa7^t^i/a-yP8 2 7 fife 

§i£ f * ±;umti<D^ v 9 -/ J: t«HRf#MUV 
RAM2 5 oir^Tfijjs-etsa^&^-t-So -o 



^•fe^*^3u-;U8 2 7(4M*15T I B u s 

-fc'^^ttSt, MUVt^7^t^*i; a -/l/8 2 7 

[0 3 2 6] I^^^a-zUS 2 9(4, ft%\?>r±;VU 

/u#^©A*l7— KSrft^LTV^SHk A# 
* HJ — AfifftihSftS «fc 5l-ftSo PBus^y^7 
i-^i?a-;U8 3 0(4, tf^ if 2 4 

6^if-^<7 2 4 2*3 < fctfJ 4 1 

5. MAG«|flWB8 3 lfia*<D^*^a — 

Sr^*i"<5. ft*J» MAGM8 3 1(4, 

*2 4 2*5j;U:j P E Gf?-iHfc2g 2 4 1^P>(73A^PB 

[0 3 2 7] 3. 18. 2 MUV/<y7r 
0 2 IC*3V>T(4, i*l*-C<ORW*»feW bfrtt * 5 tCfcf 
^ir/VsJ-— tf^-ff^ 4 6liMUV^7 7r 2 5 0 iffi 
SJStfcfclfc*. ff3y7^al/-5/ 3 ynJWiMUV 
/^7r250 (4*Mvv- 5/ ^ r S' T'f-^yu*- K (* 
-K0) , ^y^T^/T-^-K (^-K 

1) , jPEG^e-K (%«-K2) *-&tf«***oa* 

T (- 5 * -f ^W^- ^-^-^^ ^ b $ *L 
5= 0!lx.(f, ^s/7r(c*&^$^TV^6-r-^!7-K N 

<f—fi*zr*J=.9 Hl*4*f--fX«r»-a. 5(cS = ^ 
S. 

[0 3 2 8] gfrS^TW-^Sf UUfc9Xtf 
-^T^i* h(4L(4L«, ftiWSitSIIlUcff^ftS 

±(C^Rg $ ixfc ^ * y * v> * - A'WfllfifcR JBt iotft 

[0 32 9] 111 6f4S=»^7^^3.U— >a VpTH 
4MUV/<y77 2 5 0SrHSS1-5fc»KfflV^tLS3 

>'Bl|g^MUV/^S'7 7 2 5 0(1x^3 
— y 1 2 9 0, * h I/-^f^Xl 2 9 3, "r=i — tf 
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12 9 1, T Ku*R;«&;fc ' n— r-i-'a Vffi^4 
Sgl 2 9 2*»6>4'S. AAf*— V y-Al 2 9 5fc 

*.i> Mlxya-Zi 2 9 0 fcj;*)^^— fid&Wh 
fttPx-** hU-Al 2 9 efclBlISftS. # 

■SHk^ftfcx— fir?*?*? htix h u-s^/M * l 

2 9 3 $*V5 0 

[0 3 3 01 »«$^f-?t^x^ hSrtHHtf 
M-Al 29 7iCfc9* M'-v^^^feElUfcB 

stta. n^-itf-fmti^ h y-A 1 2 9 7±<o&& 

{£2ti1t J f—9tt J fa—yi 2 9 1 fcJzo-oawts&h. 

U-A 1 2 9 8±(d^tv5„ 

[0 3 3 1] .X h U-— Vf/MT* 1 2 9 3^CO#tii^ 
T 1 0 3 5 liMAG 8 0 5 (0108) (C «fc t) 

£^*t3. ^t&^T Kl^X 1 2 9 9, 1 3 00, 1 3 

0 1t»MAG8 0 5 (El 10 8) tCioT-^^fo 
*L, T K^K*5i* • n-x-v-a ffflNHS&Sl 2 
9 2C±oT^ H/-^/V^l 2 9 3(-^E$n 

5. T Ku^K*i&* • fffiMW8±»i 2 

9 2J4*fc, A7J • ffiTjn — T— is a yff-f-1 3 0 3, 

1 304*iW, r = -^Wtil;»itt« 
i"5. ##&#-W$ff * 1 3 06il 3 0 7 ttW»y- 
X X.£>*t 5 „ 780 1 (01 08) (C 
iot^if.JlSftl^ Kft^-1 3 0 2«3iV3-y 

1 2 9 0, f3-^l 291,7 Kt"*tt*&*< n- 
7">3 >ff^±§gl 2 9 2, X h L'-J't^XI 

2 9 3 fc:S8R$iiS. ^y^D^y hff ^ 1 3 0 8 \iT 

^^fc-O^ y pO-M-, JPEG*-K(*-K 

2) -cfeja^MtSifca***. 

[0 3 3 2] ffa^^aU-'/s ^tlftMUV/* 
y 7 7 2 5 0 #1£*BA' yVTy T^-TVl**- Y (*e— 
K0) -CfcSffr&fctt, *Jt«JC/<y7r 2 5 0l4tPt 

[0 3 3 3] If =i V7^a U— iXs >"5Tig&MU V/* 
•y 7 7 2 5 0 a^fi/V- y$Ty 77 1 — TVl^— K (*— 
Kl) Tijf^tf©^, /^77 2 5 Oil* M<— 
f^l 2 9 3t£&ifrt£;ftrO>5SA3o<D&^r-7> 

yt7-y7°7— 7>*— Y<n* V Vf'U X 1 2 9 
SicftlASJxa, x-TVHiATJx-* ^ h y -A l 2 

3. 3^ h^-WPfttt* h I'-S^V^* 12 9 3 



[0 3 3 4] Sffcft* J PEGBE«g$ttT^5t#, ®& 

^tv-?tiWMCU»i#^:<D8 x scot'd 

*V5<> #»08X8^ny^ttMCUA»?>ttta$iX'5. 

7t7-KDCT (FDCT) , fi^-ffc, xyFnfc- 
JSF*fls3*t5. J PEGJE*©»£fctt, «F-JHfcS*ifc 
T-^fir-** h y-Atf^^-^VS^Wc^ii 

T-ft, itDCT (IDCT) iSfTttftS. IDCTj&I 
CDttj7Jli8 X8tf>7*d #S08X87*Dy 
^ttMCUSrS*^1-5<t JKHe-a-*^*. JPEGJE 
«Srffl^5t#, 8X87"d^ ttlWJfcOKHfc 

##1-3. S3y7^^l/-i ya^prtg^MUV/^-y 
77 2 5 0(4MCU£#$:tf>8 X8 7*Oy?l;^)fLfc 
9, »08X8yo-y;?£MCUKW#J&b;t91-5 

[0 3 3 5] ff3V7^al'- XoIlg/iMUV/^ 
777 2 5 0*5 J PEG*— K^a^froTV^it 
^7 77 2 5 0^<DA7JX— h y— A 1 2 9 5{4 
J PEGJ£^S^oTV^5fc"^-fe-/uj55V^fi J PE 
G JBg&JlSrfTo TV ^ 5 l|i.-cD a ^jK-^ V h Sr^/uT? 
V^S. /<y7r 2 5 QCDMttf—?* h y-A« J PE 
G|*SM©¥-ft^/V7-3'7'n y^feSV^IiJ P 

EGJEBfficOfll-CB:, A^Jtf^-fe^WiY, U, V, 004 

'<y7 7^&ffi£titzm^V*/i'<Dt°?±jUfrt 3 tj:&y ! 

3 V7^ dfa U-— ix 3 >ortl'feMUV/'<7 7 7' 2 5 0^ 
J PEGBIf/B© J PEG*-K (*-K2) -C^SSr 
#fcCDW4*/h = -K (MCU) 

^/K7) =i ^ h •r - ^ 7'n y ^ Sr^* y 7 r t «Htt 

Sitfc^t^iiOMCU^bJibfflTf 5. 0ilx.fi, s<y 

77 1 2 8 9*5J PEG#**fr5fcfcfc J PEG*- 
K (*-K2) T-fc^raii, ttiTJT-** hy-Afi, 
Y, U, V, O ©ft:*: 4 o<D 3 HS:^ottl7j 
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[0 3 3 6] El 1 7te01 1 6(D3i^3— ^1 2 9 0 
©H»B|-e*>5. f^t^ny^Cftfffltli, A;0 

<?—9ip7*?=.>r hWf^li^ h U'— V'f'Hx i 2 

Ufff-fl^ftS (Ell 2 9) . P-x-S-'a VcO*#§ 
liA#o~ r-Va vWilf 1 3 0 3 tc<fc ^);fc;£$;f^ 
fctftts 3 2tfy h(04A#lffl;ft©'T/l'f-:/l«'*iJ- 
1 3 2 0fcJ;yi 3 2 5^ 4 0C0 5*>COlOCDpjBg/ t C 

*.ff, tLWt/W40©/Mh^ (3, 2, 1, 
0) ©J:5K?^A#f*»*&ftT^fci:1-5£, rcot" 
n^oti- r— ^3 ^1* (3, 2, 1, 0) (0, 
3, 2, 1) (1, 0, 3, 2) (2, 1, 0, 3) t 

C01 2 9 OtCtftfj^tl/S, 

[0 3 3 7] ^y7r* s J PEG^t-K K2) fit 

^©*-K» n.-/Vy ?T ?yf-//W-K 

(*-K0) , #1/V5»^7y^f-^- 

1 3 23IJ-efc5„ 2AAllii^'7;Vf7'V^tl3 

2 1 |±M#pft*l 3 2 6 5rA^n-f->3y*J«ff 
1 3 0 3 i/-^U-^a Hll 3 2 3<£>3#?£-f5 

r. tfc j:o-c4j*1"4. gL&o&a*- k i 3 0 2 

v V fife*- K©fit btttiltStiZo . fi^l 3 2 6KJ; 
oT*Jil$n-5 4AAl W^^^yi^^lM 3 20 

fclofc»RU «F#ft$*tfcA*7 f -^^hy-^l 

3 2 Q±.\C®mt£ftttt>7 : ~ 9 bZitf. 

[0 3 3 8] HI 1 Sttlff-WbS^fcW*^-*^ 

-a 1 2 9 7 z'&wb-rz? 1291 £ng-r s 

»^^*>*I§IIS«I3I5H"C*S. fa-yi 3 2 l fix 

9 , -^y77*JPEG ! E-K (*— K2) 

Xhy-Al 2 9 7rtco^jt$tifc{H^7 J -?^^^ 

bcDTft3 2 tfy htt?*=--^lC«$ftS. 
{i^iya— ^1 2 9 0t?o— -r— i/3 >"f5cOi:(i3^co 



■5„ 3 2 t*y h<D4A*lW*W-r;Uf-yu-^1?-tt v -51 
fgft 4 ocoSSw^-^fblx-;? co 5 h<D 1 oS:WRi-5 
fciOfcfflv^ftSo #Jx.tf 4^4 |«©A*fcr*-fcA'# 
(3, 2, 1, 0) <DWU«Mfitt\tbtlX^Z>k1r 
%bs r<Df^-fe/v©n— >g VCOSISI* (3, 
2, 1, 0) (2, 1, 0, 3) (1, 0, 3, 2) 

(0, 3, 2, 1) a>4oaswe*>*. tti^n-T 1 - 

•>a ^Wft-^-l 3 0 4lot/V7Ta s f ^-fe/V'T'D^ 
^<U— >3 Xitl 3 3 3(40-CS>5o 2A^) 1 Hi^JCO-v 

j^yist-v-i 3 3iit m^—9—->a>mwt^r 

1 304t/-t'<l/->3Vffil 3 3 3CD3KR£fr5 
C i: X1M% 1 3 3 4 Sr3fej*i-S. K 1 3 
0 2\i-7;^zfu^^mmn^l 3 3 2££$-f5fc«> 
tc % f^-fe^oy^^S^-FtDfilttt^^iaSo ft 

tl 3 3 4®£^XM®£tlZ> 4 A# 1 WlWtH-f 
l/?tl3 3 OttW^bSitfcHiaf*-** h'j-Al 

2 9 7 i^W^flsSflfcttirtf*-**:/^** V<D4m 

2 9 8±(ctb^T f -?4r^i-5o 

[O 3 3 9]011 6{C*3V^, lH]!S"-CfflV>ib*"V5flgB 

3 b]W£MU W< y 7 r 2 5 0 K 1 3 0 2 
K-fcoTiiiR^tlSo ^S'^TyT'T-^/W-K 

(■=&- K0) fc v97-y -f^-Zf/V^ - K 

k i ) -cttgo^-^^r k u^mum^^T K 1 

2 9 9, 1 3 00, 1 3 01©ffMAG8 0 5 (01 

0 8) {CioT4^$H5o l^-^r^T-^ 

K (*— K 0 ) -Cf*^. h i^x^^-f ^ 1 2 9 3 ± 
(C^^-y^^i— ;H 3 8 0, 1381, 1382, 1 
383, 1384, 1385 (0121) te-ifcfiC&S 
-f-So ^ * !) /V 1 380H138 5 (012 

1) KW6Wt*ii*7 K wtK*a*7 Ku^ 
fi*«W(c:I^D-Cfc5o BP*?x ^> b V-V?/^* 1 2 
9 3 fi^§)5@K{c 1 o©i„o^a.^T KU^ i: 1 ocD## 

Sr^^-U^-v'a.— /H 3 8 O^Pj^l 3 8 5 (01 2 

i) ic4Ma-r5fc»Krt»o^j/^*ttffli-4. *-K 

om Su^^T KU-^tt^fBT 1 2 9 9 (0 

1 16) CJ:*)^bix, *KttK*ftU*V>**rt« 
7K^1 348 (01 2 1) t£#gB$*l£o ^SSSIS* 
ii*7K^1 34 9, 1350, 1351 (012 
1) tt*-K0T»ttttffl$ft*VY #tii^T K^tt 
^•«BJ»t3i*TKu^l 3 0 5 (Hi 1 6) 

hfl. *KWlcHEEite * y *^a-A' 13 8 0 

*>bl 38 5 (01 2 1) ©**a*TK^K:«il£* 

[0 3 40] t £ "Ptt, $>Wi>> y97y /r-^t- 
K (^- K l ) tcfcMt 5 3 ;uy ? 7 *j ■?7—-7;\'<nm&L 
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Ztf-T* 3o©x-://W#$l£Ky^ir*$ft3£t, 
ftf$r<k£h1tAtl l ?-?l&l 3 8 0/4>5> 1 3 8 5 (Ell 

2 1) %X'<Dl-^X(D?t*r)hi'*---Mcmmz.W£& 
*ft, |ot3o«T- ^/w^rti^tti- 1 o(D^ ^ry 
^*ri5&5£&£ 0 ^^D^a-M 3 8 0H13 
8 5 (02 1 2) ^©3o©-fy7^^, IP*>9^j£ 
^7K^|j;xH/-^f/^^l 2 9 3U1J; 

ft5„ CftfjCDR^^T KU*ht, rtgCa^s/^Srffl 
PT1 380H138 5©a£Jft**y MZ. 

h*W*te, ^fffft&^Sft UC1 308^6>1385tO 

^ftm©^*y ^v^-vwor Ku^tcgsgg§tv5. 
^OjgS, ^li/V;/ &"T y ~f7~ — ^A"* — K (*— K 
1) -CIS^SP^^T Ku-^ 1 2 9 9, 1 300, 1 

3 1 1 ttF^S5B^&^T 1 3 4 8, 1 3 4 9, 1 
3 5 0 fc*ft^ft#ffi£ftS. rt»K*ii*T h'^ 1 

3 5 Kl-Cttttffl*^*^. J PEG*- K 
(*- K2) TMftffl*iH*rt«l5TK^4j«*«fel4IJa 

[0341] Ell 19 ti J P EGEEIf SrtT5 JPEG* 
— K (*— h*2) IC&ttS, ft^^yj^zLl'—i/a y 

— ^«*£j*E](& 1 2 9 2 SrjSiSI- 
tfEW&WlHli&B-CfcS. JPEG*- K (*- K2) -C 
ii, «#^J«S1 2 9 2(i3y^y h%V>? 1 3 

4 0 b*r—9'*->( y$ 1 3 4 l<£>ttt#&, * h 
— ^7VW* 1 2 9 3§r^tf^*y — *S^ — /KD|*3& 

— *y h:/n ypliyyf 1 3 4 OttX hU-^q 
*lC*IASftTl^S, f^-fe/l^*— ^^ny^d»6iftW 
Lfca^-^^h^ny^jRSr^i-S. Way 
^ftttf*-*'^ h#!7^* 1 3 4 l<0W*Sr4ffiff* 

r &ft*. jwwmch;. fn/^ny^i 

*- KlC*iJt5rtttR*ia*T Kl'* 1 3 4 8, 1 34 
9, 1 3 50, 1 3 5 1 5 fcfhg£ft5<, 

#-^^h^oy^*«>V^Hi*7-fey hffil 3 4 3, 
1 34 4, 1 34 5, 1 3 4 7 £tf^f-£;fc fticffiffl £ 
ft, tfctiJTj-r-*^ h#?V*1341 fi^-^ffc 

1 3 5 4 ?r^-f6fc*iCffil/^n5o 
^■fcy hill 3 4 3 li-<-^S?^)i^T KU* 1 3 5 
4 Kflpg £ftfc 1 3 5 8 T\ AnJWfittfcfflJflW&frT K 
l/X 1 3 4 8 (fc6VM2 1 3 4 9, 1 3 5 0, 1 3 5 
1) -CfeSo ^*P^»-M)*7tybltt, #M 
^ * V * S?a -yu-eUtT 5 BII^W*a*lc*f LT- 

H-g-f •5<0lc:#^<bft£-<-*7' 1 3 5 4t>RI^ 



/<-f h*I7^©-ry^!)^^hft^-tUT«ffl$ft 
5o *C^^tt«5*a^«!6i-5«tC>f^^!>^^h 

hff^l 3 5 6ft, l|l-«!iEffl*9*-^^ny^*SjE# 

^*^7>'^ 1340$r'<V^IJ>y h-t5<D»Cfflt^?>ft 

[0 3 4.2] ffl^n-f-VaVffltBf 1 304 (B 
1 1 6) \t=>>#—*yYv'vy?$v->?<Dmt)km 
lrf—9*<< h#!7i'*wffl;W>e.St!3tti£ft, #ft«> 

h:/n 2/^^f>>^©ttJ*[4n— t 1 — -> 3 v^-^-fes/ h 
l 3 4 7&fl-J|-r3©K:fflv^ftS. tti^n-T'-VH 
^*IJ#Mf-J§- 1 3 0 4 fin— 3 >-jJ-7-fe y h 1 3 5 
5 i:-<— xWtfy&fyT YUX 1 3 5 4 ©fPWftTte: 2 fcT 

fl-gBSt&^T K^1 3 0 5<7?;tTft2 fc'y h 
K.fc9 4x.e>ft5 0 

[0 3 4 3] I12 0I1 pfaV7^ a l/- } /a ^ 
ttftMUVAy7r2 50K»iSil*:I-a^#-^ 
> hr-^^?>©#if--y^fc:' , ^-fe^x-^©S^ 
K/flV^ftStS locDT Kf*£j&8l 2 9 27?fc 
So Z.<DWT&, /<y7r»J PEG#ftcD/t#>c7>J PE 
G*- h* (*-K2) fcfcs. f-ay*- 
^Vhf-^7*ny7tt^s'7r KIMfl § ft, * * A 
x-*:/a^li/<;y;7x;^5t9W£ft5„ ^(D^-C 
tt, p^y*^— A"M£>##&*T K^(t 
3£H&L7?0-8|5##&#.T h'^* 1 3 0 5iaot^X 

»C»*fJ$ft*. rco0«JOA7jn-r-^a ^SOfPffi-^l 

3 0 3fii£fc:##&^T KW7©tTft2 tfy hirio 
Tiry h£ft5<, t^t/VA^y^neOIJ, 
Trt{t*&«ft$ft7T^5*-3VJK-^^ h^n^^^lb 
tttti $ftfc fcfticfflv^ bft 
5 0 !^i^V#!7V*CDttl7jte, ^Ji^Th*^13 

4 8, 1 34 9, 1 3 50, 1 3 5 lftCIU^n-f- 
isayWtm^l 3 0 4^^i-5fc»{CfflVN?,ft5 0 
-jRfCfg^^T K^tt, * h U-v^M* 1 2 9 
3 Sr«*-f-5-€-ftPft©*-^3L-A#Kl*ftoTV^5. 
rcD^JT*«, aE^i&^TKt'^fi^-aV'jK-^^h^ 
t>5'Nyf?^^ 1 3 6 2 1 1 3 6 3, 1 3 6 4, 1 
3 6 5&SVM21 3 6 5t'U Y-iVrvZ^l 3 6 1 

^>-h^ns/^-r^7 ? s' ; ?^^ffi[1:5fc*l-, 
y h^WTJf^-fe^^V^Ot's/ h 3 £4 tip*. 6ft 
5. — JKfc^-fc y h 1 3 6 6, 1367, 1368, 
1 3 6 9 tt-tft*ft©R*&*T KU^-e**5. tf ^ 
-fe^^y^Ofc-y h 2^bt*y h0(iR^a.^-T YV 
XtDsU h^yf ytXl 3 6 HrfflvMbftSo K*ii 
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7f7 KMttHl 2 0»£*$ft3J; 

^•Vh^ns/^^vx^^^l 3 6 2, 1 3 6 3, 1 3 
6 4, 1 3 6 5fe5VMSl 3 6 5 b'H Y^Vryt* 
136 10fi^Oft*T?*)5. rcOtfljTtt:, tti^n-r 

- > 3 ^ftfffffi# i 3 o 4 tt, *K tt/j:3E{b* UK tr* 
ft5„ -f^y^ymti 30 $i/i> 

* 1 3 6 OIlf^-fe/U^iEf £/<y 7 r WSft 

[0344] Bll 21 fi^h V—Vfs^ 7, 1 2 9 3 <D 
imtVhio * h u-iS^/U * 1 2 9 3 Hi 1383, 
1 3 84, 1 3 8 503OC4 fc*y t-<74 
a-^H 3 8 0, 1 3 8 1, 1 3 8 2©30(08 tfy 

*S?a— /H»J|l— /Uy^Ty^y— ^A^e— K K 
0) <D3 6 tfy h<DV- K, #l^y?r?^-^ 
K K 1 ) O 1 2 X 3 K"y h<737- K, J P E 

G*— K K2) lcteft.53 2 tfy hcof ^irA'fc 

A'tt«F*ft:S*LfcA*a^mAf , -^5i MJ-A (12 
96M 29 7) 0**5«^K:nil-3«- 6>*tS. fc<b 
Xfcf , ^ * !> * 1 3 8 0 tt^F#-fb$tvfcA^X 

hJJ— Al 2 9 6 CD If y bO/S^tfy h 7£g§fft 

-Al 2 9 7CDfc*y h 0^6, tfy h 7Kg^£ftfc:r- 

Ei 2 i ic^fts^y 

/KOR*a*T KM 1 3 8 6, 1 3 8 7, 1 38 8, 
1 3 9 0, 1 3 9 1 tt«SJW&*7 K M4*» 1292 

kft, *n<o*iiro##ji*^raim-^tti--4rw4 fy 

fctcfflvkftS. 

[0345] il2 2l^h \s— 9 Sf*4 * 1 29 3rt 
^1 3 8 6, 1387, 1388, 1389, 139 

-f^rco^a*- Kfc&tts-f^-cco^y^a 
-^o##ii*TKM»*KWk:ttia]t"C*9» 



I'-v'aynriiSMUVAs'yr 2 5 OCDtti^-r- 
h y-Al 2 9 8 f*JcD-t--<TtfV^ M3J PEGBE 
lOJPEG*-K (*— K2) ©/^7Tt*iBSil 
Ti^S tr* Ir/Wx-y-^ttffl^ftfcijifta y 
hy*— y\ fc-SW* J PEG#ftc9 J PEG^e- K(7V< 

a $ ft tr * -fe /pf- * 4 < r «4 h 4 v\ 

T— ^K^M" 5g#te^y7r~<04 0©SfcW&^T K 
M 1 3 4 8 , 1349, 1350, 1351 
<ko-Cf8fc£ft5 0 ^y^TyrT-^-K 

K 1 ) lc*5^T{±, **: 3 o©!**^— 
y7 7l£**n$ft, «oti*3oiwi^7K 
Ml 3 4 8, 1349, 1 35 0iiS3o©|ir'-^ 
MZ4 >f y y x droits tziblZi&mx-h ■&<, ~t-<X(0 
-/K0«S*&*T KM fi^-yw y * T y 

t'x-tVut-k (*-ko) co^tiwic-efcy, st 

*ii*7F^2 4 8co^^S-co^— K-CfflV^tvS. 
01 2 2K*$*tTV^W*iaiSOWtt^ h I'-z'f^ 
-I"* 1 2 9 3 5 6 o©^ * U * M 

ftCO^ii^T KM 1386-1391 SrW-g-fSfc 
/^y^rcQgyi*— Kfll-§-i:**4o©K*ji* 
TKMSrfflv^5„ SE^^-TKMi^l 2 9 2tt 
Ktim^rk LXftUT KM/^ 1 3 4 8, 1 34 9, 
1 3 50, 1 3 5 ia^&S^^&^{if-£t>1b 

v\ ^ H/-i?f/^^ i 2 9 3*«*1-*p«*y*s; 

a-/KOrt»K*ii*T KM 1386, 1387, 1 
3 8 9, 1 3 9 O^^-r^c 

[0 3 4 6] El 2 3tt^y77 2 5 O^-zV-y^T 
y^-yju^e- KlcfcS^ir, -ifcoi 5 tT 2 o t* 
y h©fTWflMR^<y7r 2 5 Oic^^HSco^^^ 

3:^3— t*^ ^y*«ii^fTfcft4V\ ff?Hfffttt8 tfy 
F^^y^a- ;Ui'3 8 0, 1 3 8 1, 1 3 8 2l;ft 

m$Hz>„ 'nmmk<D try h 7 tr y b 0 km * y 

i^3.-/H 3 8 0iC*&^§^, t*y h 1 5i&»e>tfy h8 
li^ * !) * 13 8 1 Kfti* ft, fcTyM9^ 
blfy h 1 611^^-y /H 3 8 2©Tft4 fy 

/< y 7 7 fcJMft $ ftfcx- ^ ^ b ttOT® t>»9 

W $ ft 5. y#Ty 7 p 7 1 -7'/P^— KK *3 ft 5 , 

t^TW* D *^i-/i'0|l!^t*t^07 K 

[0 3 4 7] HI 2 4fi#a^y^ry7*r-7*^ ; E— 
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fti0^fr-/;Hi4 try V<d4 /VMI £ 8 t* 
y htfVH&flt&fco. il&-f A— /MII44 tyh 
<7^*y*i^-/WC&*ft$*U /]MRfflH*8 Ify btfM 

^14 10, 14 11, 14 12li^*!J^y^l3 
80il 3 8 3, 1 38 U 1 3 84> 1382£l3 

8 5fcffttt$;|t5. 5Wta*%**-»»JWt^- 13 0 6 
t 1 3 0 7 (012 1) ft* h U~ 5^/<Y ^KUWIS 
ftTV^/hifefHCjefftf-ffc:* h 2 9 

[0 3 4 8] 012 5tttf^-feyv^— 9-7-ay9*W— 
31*7*-* 7 a y y jpe G*- K (*- K 

2) OftficoS^^^U--: ^ 3 ypJ|g^MUV^ 
■>7r2 5 0Kl^<Dj:5fcLTtry\ir/I^- 
SnSO^Sr^Lfcig-CfcS. * h >J<T'HX 1 2 

9 3tt, 8 tfy h^*y*^»-^tpi«/i*ft-e«^ 

L-TSfc>tl5^^-y *v?3.— /K 1381i:1384£r 
^*!)*yi-/H 3 8 0, 1 3 8 1, 1 3 8 2, 
1 3 8 3, 1 3 8 4/^£)&540CD8 tf y M*y^y 
y i L-CM*3$;ft3 0 9 — /W 3 8 514 J P 
EG*- K (*-K2) TrttttfflSftftVN, 3 2tfyh 

[0 3 4 9] HI 2 6»41£— =>-3#— h*— KTfe 
£7. h^yr^^l 2 9 3\£b*<D£ oKLX^-— = 

@-C&5„ * 1 2 9 314, 8tfyM 

* y *^ a-A'i Ei«*#8rc»te * y 

/K 1 3 8 1 k 1 3 8 4&^fry*y — 
/H 3 8 0, 1381, 1382, 1383, 1384 
i>64S4oC8 tfy Y/^V'^i/tb IsXffimZti 
2>o ^^yt^a-A'l 3 8 5i4J PEG*- K (*- 
K2) ^ttttffiStlfcK 3 2lfy VtoGWZMtV 
^t;Hi4o©^ H:^^, WPfttfafcSS 
Ify h<D**V*Va--M£&tti£tlZ< l Z.<DW&s W- 
)>7ny7f46 4/W b*>e>fc-5„ ¥v> 
5 =i yjft— j^y F^n^ ^sffi/< y y r 

I4'<y77rt0>**'5¥— hf-ii'^Dy 
[0 3 5 0] i.flPSBftffa^-r ifal/-: >/a>*!m 



ftfl?;^ y 7 r w&S*- Ko-tft^ftlc&v ^T&B t 

¥-/Vyy7s/77-7/W*-K (*-K0) 14 
■«««lc*v^T, ff ?'J&3&£^ y 7 r tefclfrfa ©K: 
fflv>ibft3o #l/W5'?7y7*f-^*-K (*— K 
1) T*tt#f-ir*/i^>fe5SIW*ift (CSC) 
$©^y*-/Vv;Rtf77y^3 V^T-7*/^<y 

7rKHfttt1-<5©lifflv^«iS. JPEG*— K(*— 
K2) I4J PEGEi, JPEGttfiWftitfc^ 
T, MCUf-^!:8X8<D^-3y*-^yh^Dy 

[0351] 3. 18. 3 te&X—Jfi-J if 
MUV/%77 2 5 OttiB**— 2 4 9Ul*JV* 

■CtfflV^feHS. e*t-^f2 4 9IJ, y-^V^ 
-^2 4 2fc5^f4J PEG=<— y*2 4 

-^24 9^*^, H4 2"C«WLfc8*? f -y-«>Bft, 

7i-X3yhD-7238, D-A^^Dayl-a 
-7236, Jl>a-f >-^-73i— y. = ^ hn-7 2 3 7 

[0 3 5 2] JPE G#fi*- KOBf, #7--f 
-y-*24 9|4MUV RAM 2 5 0£rJ P E G 3 — y*2 4 
9cDW®x-y-£y*7/w^7 7t-5fc»Kffl^5. y* 
7;W<y 7 7 (4MUV RAM 2 5 OO^KStii**! 
"CVS J PEGa-y24 lW7-?^tJ PEG 

[0 3 5 3] 1, 3Rtf4^-V*/PB«^-^»» 1^- 
=f-^^/V^h<DS If y K0=3>;K— h<Sr-^tl , 8 X 8 

-^7-^-^2 4 9(Cj|$n5o tt**-^^Hfttril 
60^ny^SrflME©JB#"CMUVRAM2 5 0 fd&i&>3 
U *^^+^KD^V^-y-7M^cofc4t)(c, 
7-^?:MUVRAM2 5 0 A^^r^^rotl^ 
^fCO^-y^/KD^ yv'^4-tS^1-? )o CT^ff, YUVtc 
J:^3f-+^KZ)J PEGJE^-CfiJ P EG 3 — y 2 4 
I(43o08x87ny^$r, ^*|CY, ftlCU, ft^ 

[rvoii-ctBTj-fSo p< 7'>aW(«n?ii7"n y 

9 *» 1 o<D a y jK-^ y h &® <o ttj-f E. i i o -ctf b 
If^-fe/V^ (YUVX) ©Jg-C«*-f5. r^-CX 

Y^^OX 7 y 7tfi&mb *ofc 1 1 KlfTfcftSe IS* 

-^Off«^(0fc*O!^S^f-7f-v7y yy'&gfcfT 
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[03 54] mi 2 7\Zi>XZb®2<Dl&%*—#i-<< 
D*24 908¥*ffl#jj*$*VtVS. #-?Mif24 

/I'Sr^tfii^-^SpC B u s^V^-7x-^84 OJ1 

i2i;:g«t£:}oVNTv>3„ ijg*^-#^if 2 4 9©&a 

{i^^-fe/^-^-r-if 2 4 9£[fiHiTfc3^, !M— 

X^-^tftft^fTttfl*. ^-^tWPa-sy h 8 4 2tt 

M\tm 4 2 £#jf. LTMifi©:! *> R9J $ tt, ftSB 

ftS, F I FOdf-a— 8 4 3tttH^f*— ^Sr-t^RB 
u s Ma. = y h 8 4 4 § fl5f IC/^ 7 

X&fT5o RBu s ©J|p3.=iy h 8 4 4 fiT F^7 3 

it, HK, rtS5RO_C 

5. flniT, RO_LMTu^*^teffl7^fflKtf St Pit$ 
ftfc*g ©ft©7 - * V NT** v n < ocox- * Jg I # 
ffl#$ix5a»£«:3£'f£. MAG 8 0 5 14 J P EGM 
HCMUVRAM2 5 0©7 K ££f&1"3o MUV 
RAM 2 5 OttJ PEG3 — ^ibWttJT^^^^/W^s/ 
5fc&fCffl^?>;ft.5o MAG 8 0 5t4rtfIS3^7 
^al/-i/3 y^?l;ft#f5MUVRAM2 5 
OlC^ltS^^— -^^ h<D* yS'a&frVV fc^-typ 
cDAofclg— f^^/v, 3f--Y^K 4 ^--r^/K^tti:*) 

mw£Wffi\z&mt%mz&mtt£Z><n-Q. muvra 

M2 5 0a»fe#&ft35*— ^ttf*— *»ffc3-=y 
UTK$n-5„ ffl$fk$'~-tfT''( if249# 5 JPE G*— 
FIC&oTI^&Vi IIJMAG8 0 5t4WdPB u s U- 
8 4 5 cOx-^^Tt'-^^^-^ y V 8 4 2 \Z? 

jut hic&s. 

[0355] 3. 18. 4 ^7VK^WfB 
&lfC 

if 2 4 7 t 2 4 8tJf-^*+y>i3^hD-^24 
0©7-^^S'77fIii:, JPEG3-^241fc 

O 0 3j--<5 y Y*-%1rs( if247£248 li^l* 

KT$#$ti5. 

(a) */<?yK*-jtft'ffi J CBu sK*tft^ 

(b) ^Sff)f^(07-^!4 s t-<7y KP^?<0(*i 
U * lift* £ HT ^ 5 B^E^e- K 

(c) ^r*— ^-<D»«S5^-4ryj/j|r/i^r K 



[0 3 5 6] ^ -< ?^2 4 2 ©#13^— 

K^-^7-'ff i B 2 4 7K*jrt<&, ^Sr^tf^ixe><D 
KH, l^»-f^-V , *fflV^^$H5/'!777 
fc^irATi&STfcSo t^7~/ K^— #-7Mif C 2 4 

8 Heaven©?*— * *;wfe<nmmzfi 0 

if B 2 4 7 14 8 t* y h ©fT?IJ&$©'< y 7 7 & 

Y3r—if1~'i if B 2 4 7 riSiCMm i 3BISB»-&^^-0 W 
^<D=f— $ <D'< y7 7 5 o 

(d) Jfe**-K"Ctt*'<5i'KsJ— Jtf-TMifBtt* 

* Id X o T JtJg $ ttfc EHfcftig-f - C £ «rff 5 . 

(e) i'/f^-KtB^yK*- ^f-7-^if Btt 

(f) KTtt, ^7^K^— ^'fif 
ti7-^t^>3.ayFn-7l:MDP2 4 2&5V 
(4 J PEG3-^2 4 1©t K^5r^ htcSSSI 

-rs, 

[0 3 5 7] F*JS5S$ U^^^H:, ^ 

*^if2 4 7, 2 4 8©ffl*"C^SJx4qiB©«:Sr* 
Si"5o t^^K*- ^^if 2 4 7, 2 4 8-ttt^ 

y^^h^fU, Kit • #£E« • jEttflstttt. i'fc 
v^UTi9««!is*»S. S***ifcfcarirt»w^^ 

^K:*— #*-f1f 2 4 7 t 2 4 8^MWJ7-^51 

[0 3 5 8] Bl 2 8m ^VK*- ^-fU* 
(2 4 7, 2 4 8) ©J;0»Mfc*flfcaS^$ftT^5. 
**<9^K*- ^-7-^if 2 4 7, 2 4 8ttii^O^JpC 
Bu s^y?-7x-^t^7^Kt-^fift: 
<&«»*s]$i'^;**8 5 0$rgrtf. XI-, obus® 
l^-y h 8 5 1 t±x- ^t'7^3>hc-7 ICS 

K^fe*, ^7^K^-7y-7--rif 2 4 7, 24 8© 
OBu s J *-x<DmtS&'sim\z1r5m®iB%- 
A73^- h y-A(Dii*(D^n y^if-f^/^tj 

iT-tm'fx.-v hvfflW&ft?, *^<7 v 

7--1'if2 4 7 > 2 4 8^-7^^, feSV^It^'f' 
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h 8 5 1 \if~9n^^y ! —9^- J ry^3-^^Yu — 

[0 3 59] *n^tb©*^5 ^ K*- #-N-f 1ffi35 

V hn — 7 2 4 0*>f)©r-^^S'7rt5f:ftlCffl 
1/^*1/53 6 fcTy MgWF I FO^-/7r8 5 2£"£ A/ 
■CVn^o 5*-*»ftS3.= y h 8 5 3 ft, fc^-fe/l^-tf 

4 6 CD^-^a^a.^ y h 8 0 4 Ct)5t58 

[0 3 6 0] *<{>"f— 9'<*/} PEG=- 
-7x^8 54 ftiif&ffi^-- K I^T / yf V*f— 
^ J ?>JPEG3-y ; e^-;U2 4 2, 2 4 1^ 

J My?-7i-^8 5 4«f-?gf^=7 h 8 5 
ro^-g-fclft, 3.=- y h 8 5 1 , 854 \t.7 : —9*r*? y f 

3 = yFc-7 2 4 0 b^MWr—y'^<D^r^±^ 
[0361] 3. 18. 5 5&r- 

16»-t-5H*^ , ti-feyf-K:BBi-5t>«)-CjE>S. 3Efc, B 

V \£ a - * 7-* t- * ^ Y 5 C £ * g ft £ f 
■5, B&:/n-ttyf-ft, nCo^y^Srfc< §A/ 

y t° a. - * r - * t 1 9 -r * 5 r £ £ t B ft £ -f 
5. 

[0 3 6 2] ^t^-^T-^x^^ftllsWUv 5 * 
St/n^, *f-9*-f !> Jx.9 Y7u± 

y*97n yt\*ww8 t mwimz.®i-z±x0ttnm 

M-TZ. *#7ny*ftflm«r«J*{f#fc*#U ATI 
7 ! -9*7'i?x9 h^y?-7i-^S:«^t5 c A73 

—9*-7V=.9 h&7 ! -9*7'V*9 h^n-fe yiHuE 
h^V^-7^-^^x-^^-7'v J ^^ HOT K 

^KfcS. f-^ah/s;** h7°n-tryf-ft, gftflo 
ttf-9Hrzr^=.9 Mt»LTjMMWKrfr$. 
«MWns?y^Ht, =T-9Hr?y=-9 YVm^^v9<n^ 

[0 3 6 3] »K, <f-9*7i?=.9 h T'n-fc ylfft, 



«*.3££#T?t, fc-ifT'T'nlryf-ft, At)*f—9 3r 
7i?*.9 KO-»*«kai-«. ?*-9*7*<Jx9 bV7 
^n-fey^ji, ?-9*Zf*S*9 V<D%®UftlCttL&. 

&®m&mn\mnztitz7 ! -9*7'>>*9 b© 
>u * h*. #affi©wifP*5 ± o y * ft. at* 

So 

[0 3 6 4] «f-n^y^a»f>0*««f-tt^SB 

[0 3 6 5] E12|C*5V^T, HBfiLfcAjK* '±9*-^ 
^*SB2 4 2ft J PEGf-^?ffW©4t©f- 

fig, fe^w^^> m&m&. imnmn, 

■vy h©«** J ^**t<5. if-?/^2 4 2IJ^t 
yw*— If 2 4 6 *3 A tftfv* y y 'K*— Jtf-f-f U* 2 
4 7, 248^t, t^tAit^i' Y7 t —9b&f: 

[0 3 6 6] 012 9ft, ±T-9s<X%f>2 4 2<737'n 
y^HTf&i. 3fe7 f -^/-?7;gi52 4 2ftifl J ffl<DB^7 P n 
•feyf-CfcoT, ATJ-fy^-^^-^l 4 6 0, PH& 
y'-^T'a-feyf-l 4 6 2, ^!7— 14 6 

4, Ktt-WH 4 6 8, MmU&rl'Vx* 1 4 

7 0, l/^^7r-l'/H 4 7 2, JSiO 5 , ROM1 4 
7 5£f|;i5 0 

[0 3 6 7] ^^S!HBg|J2 3 5ft, 1 4 5 4triiC 
T, KSr^!7— yu¥x9 1 4 6 4^t. 

^^ix<D^!7- Kft, ^t?-r^# B^aft^coa 

ftif©flia*-&A/T?V^5. ^y-Kft, /U1465 

SrgSfC^7"K«-f-Hi 4 6 8fcsr*h,5. *n 
•c, #^fsij»gp 235ft, $t*y- vtiwt* x 5 \z 

©m^SrSft^i:, ^^-«#Sil 4 6 814**7- K* 

«aaim*jca*-f5, **ia>6, c^fco^j«t^ft/-< 

^ 1 4 6 9«:Sffi LTM^ft-tyv 7 ^^ 1 4 7 Ofcglf 
7 1 SrSS tTA7J-fy^-7x-^ 1 4 6 0*SJ:tfiI 
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[0 3 6 8] ±y : -?s<xn2 4 2ZJ:'9§kfc&<0&5> 
fc©fc-J-5fc«>t, 4HMMiNlS2 3 5/5S©JWf#u^ 
*l 4 70fc£»*t&tf::iVCSa. micfco 
T, ^-^/^S?2 4 2(D«itSr3»ftlU-CV>5lfC 
t>> %¥—9'<*U2 4 2dto»V^^SrtTX.5i 5t 
49, ^x-^/^^2 4 2fi^y-KT*E^$nT 
VMfebMMMttiUMEf: <bHfr 5 <fc 5 fc&S. 

[0 3 6 9] 9fa©Hffe»a»ff«r3lfi : -f Sfcft^jKS 

#P§B2 3 5f4, *OlR3S-C#*V^Rj!c^TOfl(f«l*rl' 
^^77^1 4 7 2 0V^<O^(0g«^fcl^^^ 

&£LT, A^^^?-7x-^ 1 4 6 0J3iO ! It7 
-*:/n-fcyiM 4 6 2lc£*.fe*bS. fcSWflMQStb 
fftc&vvc, AWy?-7i-^ 1 4 6 0f4±^-:? 
✓<*ffl$2 4 2©$fe#tmSrR»fc-fSfcJi)lJ:» 1^**7 
4 7 2©»tfh/fct'^**©rt*&I*rlN&£ 

[0 3 7 0] Koa*#IHU •JWf-fri'S? 

SB2 3 5l4±y-^;*$|S2 4 2 (CgfaBft«ym^(0 

<t, KM-># — 7x.—X 1 4 6 0{±a<* 14 5 
l*»6©9'-**^a^ h£gtt&9&*.5, ATM 

y^-7^1 4 6 011, Hff**L«WflWyi»f^o 
hhk/Sct, t^?yK^ 145 2x\i^yyv 
^xi4 5 s^&o*^^ Kf*— ^SrSttBtOteafra 

?VhV^ 1 4 5 2X^7^^ 1 4 5 3*>f>CD 
t^y Ky-*&att»0*&»$. AW^-7i 
1 4 6 Of*, MWt&l'StXfi 1 4 7 OWttiTJlOfc 

**»fP*J J;0«a***o «t 5 5 i A 

1 4 6 Oft/** 1 4 5 2*5<fctfl 4 

[0 3 7 1] Iftf-^yntyfH 6 2 ft, A73^f 
y^-7i-7 1 4 6 oicia?ijtitfct)fcofcf r -^ 

*iv*9 b\zttLX3m.ffim&*?To, m®7v±v 

tl 4 6 2ll ffifeVl&my T9 9 XWoftZ 2 OV>¥ 

b<D?|lg, XtJ tJ t©tS*ft2 5 20©f 

**7^:=* h©^»SlJK:»i-S«* ft 
T, ¥—9*-f*>=.9 h©tw<7o-S:jb5ft^tt 
(,:, tuf-^#7'yi^ F©7yy7D-Wft 



Zb&XZZ. /<* l 4 7 iwfMWff-14, lffl1E©#fl¥ 

[0 3 7 2] ROM1 4 7 5H, 8. 87t-7yht? 
§1 1) it t^fc 2 5 5 / x ©»|!t*tt5i}\ 
T\ xliO/4^ 2 5 5t.X<D$LXhZ>„ ROM14 75 
tt, ^^1 4 7 6 LT, ATM ^-7x-x 1 

4 6 0d3«tO f H»'r— ^yn-feylM 4 6 2\Z&jfc£tl 
5 0 ROM 1 4 7 5lilv^$©7 r wy KS:4^U ¥ 
-9*7'i?=.9 hiC 2 5 5W, ^©IS***©^— 
9 *7~V=. 9 Y 1 v > 5 £ 5 fclWtefflv* fctb-S. 

[0 3 7 3] ^5 > hV**, fJxti 1 4 5 2 C7>^« 2 

ic*ijpg$H5^> ±&%t<Dm&&mW}mct$^x\*+ft 

XhZo 113 Oft, A^^-7x-^ 1 46 0Sr 
«t 9 P*8(C^f- 0 AWy?-7x-^1 4 6 0lt x 
— 9*7*Z?*9 h-4 y?-7x- *g|51 4 8 0, 
V F-f V* — 7 a— *gfll 4 8 2 3oJ;tfl 4 8 4, 7K 
l^^4*««MB I 4 8 6, 7 U> K&SfcRUM* 1 4 8 
8, 1T5ij*JM**»l 4 9 0, ttlRlftmil 4 94,f 
— *|SIJW«B 1 5 0 0, gffiffil 4 9 6, mu?7>9 1 4 
9 8, IVK, ¥—9$mn*Sv9 1 5 0 5 
[0 3 7 4] ¥—9^7' i J^9 h-TV* — 7 1— XgBl 
4 8 0t, ^7>^>9—7s.-~ 4 8 2RXI 

1 484i(4, ^ISP^e>f*— 9*7*i?*9 YRU^y 
VKSrSftStS, 4>9 — 7i- *S151 4 8 2, 1 4 8 
4ft, 2oitMai^ 1 5 1 5*»t)OffilJ*lft#fCj:o 
W V^-7;n-^g|5l 4 8 2, 1 4 8 4 
ft, git^ofctf^Wy 5 -?:*:/:^^ 
Y^ts¥—9WJy-9^UK^\,Xt5'0. 2o£t> 
ttlB¥>-9l'i>Xfi&G%lX'bZb$ tiVAL IDff-^- 
^WTJ-rSo 4>9 — 7i-^l 4 8 2, 1 484(7) 
^-^^^©tb^tt^-*/^! 5 0 5tSaR$Jl 
3„ ^y^-7ai-^gCl 4 8 2, 1 4 8 4CVAL I 
D«f«7n-^1 5 1 0\zmU£tlZ o -t^yW 
Z7*y¥-f%Z. f )[C®l&iShtzb%s *^7>V4> 
9 — 7 s.— XgBl 4 8 2*5«fctFl 4 8 414, ^Hfg)5 1 4 
■'9 6Hro7K^i, ff$lft*WNI 1 4 9 0 t , X 
—fiTT'V^t ls9 — 7i- ^SBl 4 8 O^x— ^ 
Uv f ^.^<DW^i:«rS(t©9, *©f»T»^RftT 
^$iJ»/-<^ 1 5 1 5 7^bcDftJ#p{f^iaSCTiim-3. 

5^d5*V^#^, t^7>W^-7x-^14 

8 243«fc{>'l 4 8 40f-?^^lt 

*9 YJV9 — 7x.—*m4 8 0*fcttJMf»l 4 9 

[0 3 7 5] TK^4jttWWl4 8 6tt, 77>fV 

^^H-^-TSo T Kl^&JfctKHSl 4 8 6(4, IH#^ 



-74- 



* 1 5 1 5(DSTARTff^5fS:g§tv5r t4r#o 0 
M®'*X 1 5 1 5 (O ST ART 'fe%ifiWCfe.ZtlZ>t* 7 
Kl^&fcWMH 4 8.6 ft?- h4>? 
-7x-^14 8 OfcttLTSTALLflTf-fcAMfcL 

7 K^&MMHIl 4 8 6ft. T KU^4jj6Rl»l 
4 8 6 ^ 7 ^5/^1" 5 r tS-^gi-fS*-*^** 

7VM^-7i- xg&l 4 8 2*5<fctn 4 8 4©f 
— * U^** fcflfcl^** 1 4 9 8 ?VWf 

7*-9ir7Vx.9 Y<<^9—y=.—?>%\ 4 8 
Oi^VAL I Dff^#ig»£;h,£<t, 7K^4^ 
fg^g 1 4 8 6 ft* -7 ZWL'PZ^Z i 5 Kl4 9 , 7— 

[0 3 7 6] j)<?>#frmcmt%t, 
H3§1 4 8 6lif<7yK-f ^^-7^-^*151 4 8 4 

fe*it^7 yMy?-7x-xai 4 8 2«c^ 
i-So ft*S, 7 Kw^^^ffiSil 4 8 6ft, ^rofci: 
?i]<Oict ^tb^tl^o 2o©* K1-S. 

-7i-^gi51 4 8 2 ;i>>P)£>STALUf-5§-tCj;o 

ZttifiTZo LT, gflfgJ51 4 9 6ft, 

3t, 7>r7V^ftfT<!:?iJro^SVn-KU 3?fif&l 
4 9 6 ft^<D^J©&±«^^-r«t 5 «H*£$ft5o 
[0 3 7 7] t'^-fe7KD*<o«iSr^"t-5/cfe^||rfl^ 
tefrftS^, 7h^^Wl 4 86l42o©7 

rtVft 1 My Y%V>?&&.oXmft£ft, ^CDffl^7ft 

w-£**-^$;h,fc&, ^^fts$*-7^^^ 

"t, ^ot^yr-y^^r-^ hV7,]j!^7y 

K-Y>^-7^-^§Bl 4 8 2(C^btLfct#, VlM!& 

[0 3 7 8] :7V^R£j*tftiS§l 4 8 8ft, ^ffig&l 
4 9 6£fS!|#Pl/-C, ^V-VK*$<Dfc*<D0^P>2 5 5 

7 ^ y K*T« t <DtA Zffimir Zffiffl 77?fkLX& 
frftSo 7 w y 1 4 8 8 It if*, 

K (^ty^- KXftx^s/T^-K) T-Htrt^t 
•CfcS^SrfcftSo ^i^>-Kft$^2 5 6^T-C35-5^ 
£fti^y^e- K^tefrti, * 5 -C^i^ft^r v 

[0 3 7 9] T'L'y K£/fclKli££ 1 4 8 8ft, TIEcDff 
^-MtI>\ t<oe*Srl^-y*7,^ (r e g 0, regl, 



reg2) \C±yh-rt>o ^7VK7y/i!f 
ftfcfi^Xxyz/^-KK&S^, 5 1 l-fi££ 
regO (2 4fyH (C, 512-2*f^reg 
1 {2 4ifyh) JC, LT, mT-fflteZT e g 2 
(4X9t*yh) fc, ^riX^75/^-rS. 
•Y>7°^~KlCfc5^g-ft, OJrregO (24t*y 
H f£, 2 5 5/ (6$-l) hegl (2 4t*y 
h) (C, ^ LT, i^T-W^ reg2 (4X9fy 

[0 3 8 0] ^y?*— KfC&VvC, ^T^^S^^g. 
IM^Mcfc^TJl^ftSo regOOfWi: 
t, regOtdregl ^Dx., -t^** regOt 

5. reg0g0t*>5if, r e g 0JC 5 1 0 SrJP 
X., r e g 0 {C^i"?),, -l" X :? !M >- ft 

if*D§it/£V> 0 ^y^y^y^fflffl^^yyiffc 

[03 8 1] v'tyT't- KfdJo^T, SiT<D{tmtf& 
?Mcj3\,^xm?f£tlZo regOCregUiO 
x5„ *0^WtU^ft2 4 tfy 16. 8©BS 

^ y b-cm^$ti5o itrlEJPgW^^r e 
g 0 tC^-TSc #&fe£:<Djg 1 Ky Y jfi 1 T'feS© 

fi8t S 'Hj:7y7'ft$.5; ^COyV^fx EP*>r e 
g2cDffl^i, ^i^^KM^&lttftiB^'r-^^niry 
f-1 4 6 2^ib^ 7^^*^1--5„ 

[0382] mm%vtm& 1 4 9 0 ^^^j^^ 

ff5. ^fT?IJft4 X 5^7C-Cfe5o ^1^P>^4?IJ{C 

-Cfofl, ^JiJft«»|q(c^Px.e J tv5^gr^^^^A/ 
•CV>5. $IJ^^1 5 1 5£>ib<DSTART{a-5§-#g», 
**i»fci#, ^J*^ffil§ftWT(Oj; 5 KSK,, 
[0 3 8 3] 1) /^^.1 4 8 2&tj!l 4 8 4ri>ib^&*T 

fc\ ■ftill^** 1 4 9 8Sr^^-^;aT»*#i 
T-t 5 J; 5 K-r3 0 

2) 1 tfy b7U yfyuyy^^x^X, 7-fV# 
fSr^lt, AX 1 4 8 2*3j:tFl 4 8 4 2i><bfT?iJ<7? 

b^tS^t t<o«rS^-T5 "MAT S EL" fff-£<b 

[0 3 8 4] 3) x-^:/v^ ^'y^-7i-7. 
SB 1 4 8 Ori^A^ftST*-**:^^ h^^v>t 

mj-rzo 

atBStfl&Hl 4 9 4ft, ^-^^^i^ h©7K¥MK 
SrfrPo *¥ffiF^<i*JV>T, ±x-^/<xSP2 4 2ft^< 

X 1 4 5 l^^,X-^^-^n:^ h y-A^SftSt 
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■K m<D7-9*79*9 h©M«r*HB-i-5. *LT, 
7G.X MJ-A<75 2{g, Xfi4ffOg$-efc2>v :r -^^-y 
S^a:^ K©^ M>— f'— 9*79*9 Y 
YXit ¥9±Mc/< y9 £fh%Zbtth*)'&Z>1t 
ft, ttHttflMl 4 9 414, *Ar-7y Y&m±\ctj:5 

l 4 9 4TOT©J; 5 (cfSf£-f5„ 
[0 3 8 5] 1) I NT_S EL(t-^Sr^-r5r 
«toT, *?-9&&v9y9 1 5 0 3tf^t)7-9*7 
9*9 h£S6B?iJ-f5<fc 5KU jELI^t*— 

2) »g5f S^-^aJ-^*^ H#«)R|S:«llli-5fci{) 

[0 3 8 6] 3) 7~9*79*9 Y<<>9-7*-7. 
Ml 4 8 0/55% 57*— 9*79*9 K££»4A*vfe^<fc 
pJc-rssTALLft-^Sr^-rs, r*t*si5Si:$n 
£&f&f4, fflTJ* hy-i^A7J* Ky-AJ;9Sv>^ 
fc>T-;fc3 0 STALLff-^-li^n-/^ l 5 1 Oltgfe 

T*59, HPJIP^I 5 1 5 0WfPfl|*KJ;oTfl|J«*n 

[0 3 8 7] 77>f >wmt8:3s£xm&%-mmz.&^ 

1) *0>x*iJ:tfy«£««:fUM-S. xlS**!)-*-*-* 

fcftic, jmhasi 4 9 6H0dpjHiSrfflv^rgta«>xisai 
ic^ts^mm : f^9<Dx^^m±^tK 

fflv^SS©xlfi>t) 7k¥*J «fc tfSitT^w * <z> x 
«ril< J:5fc1"5, yft»«:tt*-f-*fcftt!:, gffi&l 
4 9 8HdBj|[«Srffl^T.gWE©yjK*k:*spXttfia9 f 

[0 3 8 8] 2) yffifllfc>f yfynf-7/i't7t 

3) ■*.W®&4^7 ! v9 *7-—7)\sx.ls YVKMW- U 

[0 3 8 9] 4) S$*7V|-^t,U?|< 0 

K^tc:43i>T, ffffsv 1 4 9 6 ttfiJlT© 4 5 fc 

1) *7y7*-YlClS^X, *>5lo05^^»lSr« 
[0 3 9 0] 2) Kfcasi^THu 9\>7 



3) 9*y7*-Yx\t, ^jR©«9«f-ca«fTfc>^«. 

7*7VK<0|JM&£3I<. 
[0 3 9 1] 5) £3#!7>'ba»&l«:§l<. 
<feu^*^ 1 4 9 8f±, 7—9*79*9 Y4V9—7 
4 8 ifetWC, hVf* — 7i- 

*ffll 4 8 2£tfl 4 8 4|C*5V*T, 7*— * 
*©&#©**l£IB]«rlitJW-S 0 flii^** 14 9 8 
tt, ft%W&£%Wr%>fr, tti.?— 9*79*9 Y4 
>-9 — 7*—X&l 4 8 Qfrb<Dm : £(F>J ! —9*79* 

9 Y%s<y7T*5<Dizio\,^x&t>h%<D&wmr-h 

1 4 9 8 (4, ffl®'<X 1 5 1 5©0Jfl«f 
[0 3 9 2] 7*— 9mmU 1 5 0 0 fi, fflffl;<x 1 5 1 

5©fsijWf^iaotM$^5 t 7*-*isuw»i so 

014, STALLWtf-WAi' h>f^-7 
^-^.$151 4 8 0, ftmc, *^7>Y'^>9—7*~ 
*«J1 4 8 2&J;tf 1 4 8 4iCj|W-r-5r ttCioT, 

hZ>'<>9-7 *-7.m*. </&L<n4y9-7=.-*\tm 
oX^fa^~n > f-9*79*9 Y&%:ttft<otzm^ 
m<D4 y^-7i-7©iti)»T- 9 SrSftSl54T-t 

[0 3 9 3] ^-*ga#n 1 i4, fifjlj** 

#tf§S§l 4 9 0j&»e>©MAT_SELflH|-fc, *fFfl#fi 
Sil 4 9 4frt3<DINT_SELim-bZ$tf®ms<7. 
15 1 5<0«W«-^(C^C-C, x-^/^l 5 1 0*34 
t)!l/^^77^/H 4 7 2 frtbrtf— 9*79*9 Y 
%/<7 1 5 3 0 WB5(l**Vfc7* 
— * f4'<* 14 6 1 -fctJTJ £ft5„ 

[0394] H13 1H, Ell 2 9«M7-!!'7'nt 
4 6 2Sr<fci?^*Bfc^i- (> m&.T-97u± v -* 
1 4 6 2 14, /<4 7y>(V%m% 1 5 4 0 t , &%L<D$ 
7 to f-\rWtJt'>tl 54 5, 1 5 50, 1 5 5 
5, RXfil 5 6 0 b%G*Z a &X<D$ 7-7\*;V7° 
n-feyiH4, A^V?-7i-^l 4 60 (El 1 3 
1) \z£vXmW)£ixZ>s<X 1 5 6 5/^e>A7j£§tt& 

5 4 0 (4, 1 4 7 2 Sr&il-f 3, fW#PflHJ-U^^ 
1 4 7 0^5)©«IJ«Ift^iC4o-Cfll^$tl5. ^T©^ 

@l2 9©uyx^7r-f 
/H 4 7 2Xt>'ROMl 4 7 5 frb<DAt)Z t>/^ 1 5 
8 0Srfil6U-CSft®3^t*SfeSo ^:T(7?*7-^^ 
*/ls7u -yVb 7y4 Mflfltt t © 1^7314 ^/W- 
-?ii*,X**\ 5 7 0i/jJ<3, W^x-^^D-fes/lM 
4 6 2 COttiTj 1 4 5 5^5, 
[0 3 9 5] ^^^7-T^Ml 5 4 0}4, ±T©7> 

%^.—7 )V*hZ.b\c^X, ±X<r>jj 7—7**^7 
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a-tryiho-r— 9*7**9 hcD7n— fcftijffl-f 5„ ;< 

47yjy$m%>l 5 4 OCDflCte, U-v^y/"?^^ 

t>> /^Vyy^^JWgpi 5 4 0©/V/7^W7 

HC-efoS 0 /^yy^f l/WRWiS.'*-* 1 5 6 5a>£>V 
AL I Dig^ZgimZo '<4-?74^Wm.±5 4 0 

TV ^ ^-T 7° 7 -f >7^r- ^il^T©* 5 — f- 

ilfrb, yy^-<DHit>. B|J*>, VALIDifft ftco 
'<J7y->{ •y*7—\?\a&h* ^(DXoicLX, ^-Yt 0 

[0 3 9 6 ] #7— ^-^/1'7'n-lr yf- 1 5 4 5, 1 5 
5 0, 1 5 5 5, Rlfl 56 0IS> Kts 1 f—9*7*x. 

f— 9*7**9 hdloCft^fiSUV^. 

^7 MSftA4ocDf-^/W£f|oTV>Sfci6, #7 

[0 3 9 7] #7^^*^n-fcyf-C04»K:H:, fc»7 
•feyWD^Fggq (o p a c i t y) ^-r^Ufc^a-f 

013 1 fcte^ftT^ftV^, 3ilJ®l/<7 1 

4 7 llC««£;tiT^ai!ln©EBa**0, #9— 

^^c-fe y f-tt^aw^+^/usriEL < ± 5 
\z.fflffl'<* 1 4 7 i*»e>©«iwr*«r*ifci-* 8 zn 
fi, &SH&&a»jfN;:*5^Tf*, ^S9i7^*/Mc*r 

[03 98] 1132^ *7-^-y^7°n-fe5/f-l 
545, 1550, 1555, 1 5 6 0?r (1 132 JC 

ts^xti-fttoK i 6 o o -c* tfc) <t 9 

**7-f-tWntytl 54 5,- 1 550, 1 5 

5 5, 1 5 6 0 i4, M^n y7A1610i, feS - / 
ny^B1615t, tf 7 ^*P3II§ 1 6 2 0 t , 
•JfcTSBl 6 2 5 £, ^7^fttt7'y^-1630 
i, mtl^imi 6 3 SttrfllA-CV^*. #7*-^* 
Wntyf 1 6 0 0 fi, mmt^^'y*? 14 7 0 
*>bOS(|^ft#Sr/^ 1 6 0 2£gSLT, s<<{7 , 7<< 
M&J#Pg|$ 1 5 4 0 7)^W-f^— -5§-£/<7 16 0 4 

^1 6 0 5?rg*L.T, ^Ofttl*7-^^^/V-7 P n-fe5/ 
«>b<0f-?^i^ h£/<7 1 6 0 3£gftL 
T, A#>f v*— 7*— *1 4 6 0frt><D? ! —9*7* 

[0 3 99] fea^n y 9 A 1 6 1 0 ft, ^^1601 



16 11 (CttiTrf 3, feffi^n y 7 A 1 6 1 0 
fe^T, M7"P y ? A 1 6 1 0 lif-? tT'i'i ^ h 

l 4 5 ifrbo?— 9*7**9 Hc/FSfBSSr» 
tt, T'u^KMMtttT'u-^mTffitcop^Sriai 2 9 
cDATJ^Vy-^^-T, 1 4 6 0H©W7r^?C 
iotifflL, 1129©^7y(-V^l 4 52^P, 
CO ^ 7 V K * 7° U» #? -t -5 A> * fc 7* U- ^ K * 7 - 1 C 

7 ^ k * » > ktj 7 -x- 9 zmm 

[04 00] — «fe?gra3E*fc*S^-C, M7"a y 9 A 
1610(4, 111 1 2 9CV<7 1 4 5 1 2ocD#& 
ffi?:ffl^T4oco*7-7 ; -7*/WB^H^ffiFfl-r5„ T 

7Ai 6 1 oriy— *\?9±jv<n&\zjfmw&.*7u?k 

[04 0 1] fgTg&^ragfcKfcVvt, ^S^n y 9 A 

i 6 i ofiy~^\f9±ju<D^y-{z.^mm^y°i-m 

*u y°\sm&$thiti3 7-7*-9\cmm&m&&& 

6 1 Ott2oO>f—9*7*=.? 

[0 4 0 2] ^v ^ ■r3.T/^-T-v ^ >'^C*^V^T, «!ia7*o 

7 ^ A 1 6 1 0 ft 2 rXDf-9 Irzf'J*. 9 Y *tm? 
5. «a7*ny^Al 6 1 0 «#^<7)#^tl7*n y ^ 1 

6 4 0 t, «ia^ny7A^/V'— ni^y7 1 6 4 5 <b£r 

#^tiyny7 1 6 Aotmws^\c^xm 

[0 4 0 3] 2 00)7-9*7**9 hKMLUmg* 
n$o loco-r— 9*7**9 2o<Of- 

^^^7 h<Dm*hz>ffimy7 9 9kz£ixmmt 

[04 0 4] 2<><D^-9*7**9 hlcttLMWinZ 

7 7 1 6 4 0 CO 1/^7^(4, @13 1 CO^V 7°7-T V® 

15 4 OlCioT^^iX-S, /<7 1 6 0 47)>f>C9 
^-7*7Wf <t o T-Y ^— 7*A- $tV-5^i»7 7^-^ 
^^^X5„ ^a^ci y7 A9/V— u*y9 1 6 4 5li/-< 
716 0 lfrb<Dy*—9*7 f J>x.9 h*5i0?/^7 1 6 0 
3^bCD7 f -^^7'^i7 ht, V><O^C0^atg7*D 
77 1 6 4 0CDHWj££gtt®<3, C^t)Sr^C0tecD3l 
n^fltz^mm7tiy9 1 6 4 0coA73i-ii5„ 
n y9 A9/1— vi/y9 1 6 4 5 t>/-?7 1 6 0 2?5^5jCO 



(04 05] Wm-fn y V B 1 6 1 5 J4, 1 6 0 1 

i^©f-?t/v'i^ ht^^l 6 1 1 a>fe.<Dg&##j 

ffv\ «S»«K:W-*S^fcf*-^*^*^ l>«v«*l 

6 16 izmtrfZ. M/o y y B 1 6 1 5 flSBfc&a 

h/<* i 4 5 la^^r/waa^/cr 

"^vKfcjtfLT, 1 6 0 3 

tt-Sirt'bfc, 8. 87*—?y h«2 5 5/^SflS 

[0 4 0 6] IEcD^U— ^Srfco-g-j&tc&^T, fea 
/cy^B 1 6 1 5 {4, 7'^toa^ttfc2ocDx-^^- 

(4, SufEcDfq;$>f>2 5 5£§|^T, ^r^Sr^yfeyh 

t> M/n j/^B 1 6 1 5 14 5 1 frh<D2 

U aoT^S^MfttfSrffl^Ttea^ny^Al 6 1 0 

[04 0 7] 77-( yl^fe' «fc tJ*&#* JUilfcV * 
t, 9BM7o y^B16 1BS, 3Rfcyffi*<Z>£»:8lS$: 

ants. ®&&&m&mcts^x. toa^n ^^bi6 

1 5i4V-*f ^*A'©*9— ICFaW«tr^HRj| 

[04 08] fea^n ^B1615tt, #^W^mH 
ycy^i, Saa^nyy- B^/V— c^yy 1 6 5 0 b 
#H^«^ii, »a^oy^Al 6 1 0 

wtwiP«-cfc5*5, iaa^o y^ Bi//i— xt*?*>t 

1 6 5 0{C:fo^-C(4, ^*1 6 0 1, 1 6 0 3, 1 6 1 

i, 1 6 3 \frh<»7 ! -9'*y'v^t Yb. mfcztut 

1t&KB£fv V 9 <DAtl\cmZ><, ^M7n y y B >f>\>- 
u?Jy? l 6 5 0!)^ 1 6 0 2d»b©»JfPfBf-fcJ;o 

[0 4 0 9] t^ai«1 6 2 0li 1 W&7*By?A 
1 6 1 ObUm-fuytB 16 1 5 fab 
^<o*»Srijg-g-f 5o dttl4, /<*1 6 0 1 Srgft LT 
XM>9—7*—x l 6 4 0/5><b> ^16 11$-I 
ftLtM/ny^Al 6 1 O^P), 1 6 1 6 
SUaa^ny^Bl 6 1 5frb, ttt, ^1 6 
0 5%&&LXl'*JX?7T'(/l'l 4 7 2frt>, 
tKOAA&gft&tK /<* 1 6 2 1 fcife^Sftfclt** 
W^-fSo tTy ^JmJItg 1 6 20t), 1 6 0 2 COM 



[0410] try y*an»§§ 1 6 2 0 f4. «* ftBttfia 
■MftCttoT, JMeSflMHJil-S 5. fy^ 

JngSl 6 2 O©0fSOBfW!HlftffK:*J»t*»^S:« 

t*y i/flngLSIl 6 2 0 l4#ka7*o y y B 1 6 1 5^6© 

[0411] iev^u—? zft^-g-mzte^x, -fry 

•fey M^-z/^j&sfilbSjftT^SiSfc, fyy-ftg 
32 1 6 2 0{4^SW*f-^^/^6)3j-7-fey K0fc§5fe 

4k«*/«3i*«l|IK:*SV>T» try ^4Pit» 1 6 2 0 (4 
Aaa^nyyB 1 6 1 5 
[0 4 12] fltig&SmXiU&fiSV^ flW^/Ut 

tr y ynmstt 2 ocotfw^usc/^- ^^ina 

%2^^i9)^X. MM^9;\^ 

Sr»*.5. (Aft) S&l 6 2 5 {4, /^l 

6 2 lSrgft LTtfyy*AP3l«l 6 2 0 A^&S 

mtxoitim&wMrcz. a»»«:*-t-try 

b<om-±. ^?7r^;H 4 7 2H^1605 
©B PffitiaoTSf^S. B P(t-^*«PR-t-*tt 

^SrWTo^t*^. «9»re>nfcw*tt^ 1 6 2 

[0413] #gcx-y/W 

[0 4 14] 

1*27] 



£27: 75^a>f-7';l' 



bp field 


Meaning 


0 


Bottom 26 bits are fractions. 


1 


Bottom 24 bits are fractions. 


2 


Bottom 22 bits are fractions. 


3 


Bottom 20 bits are fractions. 


4 


Bottom 18 bits are fractions. 


5 


Bottom 16 bits are fractions. 


6 


Bottom 14 bits sic fractions. 


7 


Bottom 12 bits are fractions. 



[0 4 15] ftmwmxUl 6 2 5tt, 
XV>!}kft-K2l<Di¥m%fto a 

1) m&xbfiit&m^xhzfrb'5fr*»:i£t 

2) $0fcT&ftfc8*<Dtt*ftt#2 5 5 J:9**l^a» 
* 9 y^Xfiy 1 6 3 0 1 6 2 6 L 

[0 4 16] WOlfCfcnfciBt^olftW*****^* 

Ml 6 3 5(4, /^16 1 6 0ftl7*ny^BOfflAi 



1 6 3 l<D9yl<'-7$.tcftyys<-<Dmj3b<Dtl:fr 

[0 4 17] ^JE<D^U— 9&i>^, TV^g&Lco 
£7£|C*5VT, &MitM 16 3 5 (4&Sl7*n y ? B 1 6 

7°*fcf47 7'<-l 6 3 0<7>ffi;ft&fflii$i£S„ 
[0 4 18] ]E©^W— *&t>o-&j£fc*5V<-C, #S 
fcftl 6 3 Sti^a^n y? B 1 6 3 0 CD^< odMTJlU 

^P B 1^(-fc^T, £»fc§i5 1 6 3 5 (4^7-* * 

teOftfftCfcVT, #ltftg|5 1 6 3 5 14, 7 7>-:/Xf4 
7 y/*- 1 6 3 0 <Z>|itfj£i§ifi$tt5. 
[04 1 9] Ell 33(4, 03;U4l 6 4 0W<t5^ 1 

1 6 4 0J4, KtftWSBl 7 1 0 i, 2o©jBit'< 

9^K»asci 6 6 Ofttfl 6 7 o t, 3o©#na 

3SB1 6 8 0, 1 6 8 5, &IM 6 9 Ok, 2A*Jn* 
351 6 7 5 2OC0»*^o2A^mmgBl 6 95 
<h, ^^ 1 7 05t*fi5„ 
[04 2 0] KtfcWSB 1710li, 0129 CDi&JfP 
ff-§-U^7y- 1 4 7 0*>P><£>MODE{f-5§-l 7 1 1 £, 
012 9©A^V^"7i-^ 1 4 6 0frb<D2o<D 
SUB{f^-l 7 1 2^t5SWAP{f-^l 7 1 3 i&gtt 
®5 0 7 1 0(4, ^tlb<Dm%%'lg.%L 

T, in**"*? y Kltfatft 1 6 6 0 *54t/ 1 6 7 0 £ , 
#IftHlil 6 8 0, 1 6 8 5, *34t/l 6 9 OlCfc 

(4, #mtl:/p y 9 1 6 4 0 £91* fc»fls©-e# 54 5 
teflMS-f-S. 1 6 4 014, 

[0 4 2 1] 1) K : SUBltfl 7 1 2K 

A^3 1 6 5 5£A*ll 6 6 5(dJP;i-5/$\ *fc 
{4, A^/1 6 6 5^6§Ko JEKU SWAPM6 93 

2) /^/^^~ K : A^3 1 6 5 5 fcfflAlC^'f 

So 

3) *HB]*-K: A^l 6 7 5&ffim7r99 k it, 
A* 1 6 5 5 t 1 6 6 5©Sg«rit5o SWAPff 
1 7 1 3(C$V\ A^ll 6 5 5*54.1/1 6 6 y 

[0 4 2 2] 4) ^HR»*-K: A# 1 6 5 5ldA2l 
1 6 7 5&jttt, *(0>£*£2 5 5 "CMS. INC^ 
7^ 1 70 8©W^{4s 3ELVSft*feWSfc*fc'<* 1 
7 0 7K*lt5, i©^y-S?©IS*t*llrt _ '<* 



5) K : A^J 1 6 5 5 KA# 1 6 7 5 fc»lt 

[042 3] 6 ) HP«|M3 i^HR**- K : A^J 1 
6 6 5*A^)1 6 5 5(CflPx:5;&\ *fc|±, A^3 1 6 5 
5*»e>5l*, tOi*KW]1 6 75m, *uc, 
C©^Sr2 5 5-CfiJ5o I NCI/-^? 1 7 0 8<Dtiit) 
14, iELV^;££r#5fcJ6(.v<;*. 1 7 0 7{c&5;ioo* 

[04 2 4] *Pf[^7 ^ KttSffi 1 6 6 0M167 
0(4, aDH«^«to-C«aLtt?f 5«t5^1-5feftii, 

6 7 5(4, /^ 1 6 6 2tl 6 7 2 <0*P#^9 ^ K» 
S?y* 16 6 0&1/1 6 7 0 <Dffl7j£r-g#:u *:cr>fp£ 
1 6 7 7^^*1-50 #fi{fcn^y7 1 6 8 0, 1 

6 8 5, fttM 6 9 0(4, JJraoaMB&Stfi'l-SfcfcK: 

1 7 1 0frb<D/<X 17 1 4(0$lJ#m-^lC4o-C«fi)c$ 

[0 4 2 5] 2 o<Djq$:£#oSftggl$ 1 6 9 5 (4, /<* 
1 6 8 2 3&»&©A#«r'<;* 1 6 7 7 ^(bcDA^KSHt 
•So *Lt\ iJiaWlC/^ 1 6 8 7*341/1 6 9 2i»6 
OA^Wft^P^So *P»^1 7 00(4, jgffgBl 6 9 
5<£>ffl7J©T&8 Ify htd^ai 6 9 5©ti|7J<£>±#: 
8 If y hSrJPA5. JPJHS1 7 0 0(D^±lfi4 I NC I- 
7 0 lH5y^£*v5. INC^^170 
1(4, ffi-f-1 7 0 2\Z.£-oX4*-7°^£tlZ>o f 

9no 5\*mnm 6 9 5^p><7)«srfEtsi-2)o m 

t>ft^-l 7 0 2(c4oT-i'*-:7Vv£H5„ 

[0426] 01 34(4, &tfm*fe<r>y'vy9m*7F 

1) %%\£9^f^f—9 : rO^^gp^-r/K-iS^T, 

2) ^jW-^yK: *9-t^SMad>P>*5. *9 

* fc (4* 4 JVXh 5 1 1 555 -C f 5 0 
[0 4 2 7] 3) MM : Kf*-^Sr««i-*. 
MSI*??? h^tfy h-Ty/SfeU/^ hvy/ffc 

\£9 ^)Vf~9 (4ASW(c 4 o©ft^e>Ji:5. 

<D=f- J c*MtK°9±/l>(DJFMW8[-?hZ>o \?9-*A'7 ! — 

[0428] mm^mmm-cmn^ii^^^mi 
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(a c 0, a 0) \*yumW£ft1t\??±;l'%7-- a c [0 4 2 9] 01 3 4 fcjSttS-frjfc/n <y * 17 6 0 

i^S^ga 0£, r \t "*7±y h" fit wc () \* ft, ^^(O^y—^^uy^h^m^^^^v^^ 

over, in, out, a t o p (D&^U— 9 <£>i£ <D 1 <><D% 7—^^>*;\st^&W^V>*Mcttl,X 
*^ls-*bnm£ftX^Z>o &tz, &J$*y*M*£iW »f£LT, tUJjt't-k/W* 7~ZnZ>o £Jt±<0fttf;£ 

PIXEL Composite ( 

IN color A, colorBrPIXEL; 

IN opacity A, opa c i tyB : P IXEL ; 

I N c omp_o p : COMPOS I TE OP E RATOR) 

( 

PIXEL result; 

IF comp op^rover, rin, rout, r a topTfcS 

t THEN 

colorAtcolor &%*7 % ; 

opacityA, opac i tyBSr77y^t5; 
END I F ; 

IF comp__op# s over, rover, loado, Xte, p 1 u 
stfcStTHEN 
X=l ; 

ELSE IF c o mp_o p#in, rin, atop, rat 

0 p-CfeSirTHEN 

X=opac i tyB ; 
ELSE IF comp_op*out, rout, Xte, xofCfeS 
£ THE N 

X = not (opac i tyB) ; 
ELSE IF comp_opj5loadzero, loadc, X\t, 

1 o a d c otfcSiTHEN 

X=0 ; 
END I F ; 

IF comp_op#*over, rover, atop, ratop, X 
tt, x o rt*5tTHEN 

Y = n ot (opacitya) ; 

ELSE IF comp_op#plus, loadc, X(t load 
c o ~Cfc5> t THEN 

Y = not (opacitya) ; 

ELSE IF comp_opjjSplus, loadc, Xt±^ load 
c otfcStTHEN 
Y=l ; 

ELSE IF comp__oprt*in, rin, out, rout, lo 
a d z e r o, Xte, loado THEN 
Y=0 ; • 
END I F ; 

result=coloA * X+colorB * Y; 
RETURN result; 
load' t' loado'««ft^ xlv9 1 7 6 5tf? 7 y^SX ?\mtfZ£ti6 t, fF 

[0 4 3 0] 01 3 4^*5(t5yny^ 1 7 6 5 fi, :/ ^ny? 1 7 6 5 7 y <t 0 fcfltefcSft* £ , 
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[04 3 1] ( (x — m in) mod (max — m i 
n) ) +m i n, 

w^t, mi n irma x 7— (C&I^TfF^ftSfllr 
'NStfcfcfifcitfcTS. Mcmt&kmt LTI4, 0 
it2 5 5#S£LV\, @ 1 3 4\Z&tt%zfny>7 17 7 
Ott, 7"d^ 1 7 6 5 £ 

fcj;5tr^-fcA't^mwi-». rr-e, o 

OROMA^#f)}l5. ROMrt(Dffi(48. 8 7*-? 
JRffottJKtt l 6 . 8 7*>— 7? h-e*tt£*i3. «:/ 

[04 3 2] 7*7 V P&Om 1721 J2^/£CD|»&flIi: 

TcD2otDX7"-i?{r^oT : fT^^5 0 
1) 7^7°£/& 



2) *§M 



£££#2 5 5 "^fyr K©2o#*> 

[04 3 3] ^■rfTVe—HKiiavvc, ^xj";ytt 
^7s/7'{i2 5 5/ (*$- 1) £;fc&3fc*tC8. 8 
7t-v;/h©ROM?rffl^5 ( CWfilt 1 6 \f y Y% 

VS^ttB r e s e n h a mV&t&T/l'^y XMZ.U1t 
T/i^UXA&ffll^S. ^®7;^i) XA££tTK* 
1-. 
[0 4 3 4] 



Void line draw (length: INTERGER) 
{ 

d = 5 1 1 — length; 
incrE=510; 

inc rNE = 5 1 2- 2*1 ength ; 
r a m p — 0 ; 

for (i=0; i (length; i + + ) 

{ 

i f d (=0 t h e n 
d += i n c r E ; 
e 1 s e { 

d+=incrNE; 
r a m p + + ; 



} 



} 



} 



[0 4 3 5] 



Blend= ( (end-start) x ramp/255) + sta 



r t 



2 5 5i££Z>mmz*tLX®<i)&X&ft*>tl?>a 

(e n d-s t a r t[C*fb) "TV**" *fT* 3 7* 
nytb&Mgk-i-Z. if*-^/<^»2 4 2#ff45 

-mt&£m®& (gcsc) \mt>*7-m*$>z> 

[0 4 36] \^<ofr<Di§&l£3S^Xtt:, GMO^yi? 



>T£tt5 0 
[0 4 3 7] 

0 1 f x£6 3 

out = 2(x-64) If (64^x^191) 
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ZxZ-^yyjyZ 5 ft & tc <DWe&tf£&V 
<fo<5 0 

[04 38] 1) Ell 3 5JC^-r<t 5ftBttK*©** 

2) y-^H«t©i65.»»»*»»i-6. 

3) BWB»©*¥, fiifi-yy-y-^yy M«r 

[0439] 4) ±efflf««r*aa»K:fiafeu, ex*?- 

■y-yy-^yy v^, mm, biH^^©^^**? 
Blii-rsSBih «5^tf-y*f-vy°y ^flRScfcif© 

[0 4 4 0] Hi 3 6ttBfttf*-fc*HK<&tt*fc:fc^T 
* BH&© t? * ir/utf^ij/fl Wr?*> * i © £ «j£ L T 

fc io T3I § Hi SftSWfeggJki: v^y©yn y * @£ 
1137 (C^-fo BH£^&^>'v 5 > 1 8 3 0 1ST KU"* 
£$£3 18 3 1, 1 8 3 2, ffilBJSJ 1 8 3 

3, 11*1 8 3 4, mmx, mMm&xi?> 

SSfe3SgP 1 8 3 5*»P>/£5o 

[04 4 1] T KU*£f£gBl 8 3 1 te, ^*t'^-fe/U 

5o Sfc, ClMA^yfy^^f-^/H 8 1 5t 
K&i 8 1 0(D\fi'±;vt^b4 "yfv9 7sity±v b% 
**>&1tlb<DT K wo* T Kw*£j*«l 8 

-<y? !>y*«r«M>. %—*a>?4 *y y /^©7t~ 

v y h left 2 ooffilSdJfc •? , **vSrH l 3 8 

ij—*fls?4X9 y y?ti\ 

l) y-yii&©M&ffi^ («F*ftLOH^fcS, 2 
4. 2 4ffi&) . {£« (0, 0) teM£©£±«T-fc 

[0442] 2) 7k¥, Sitco-y-y-y-yy/^/^ (2 

CD», 2 4. 2 4»K) 

3) ®fcmm&mmz.&ttz> 2jt/h»^©ttjt^s% 

■f 3 b©b p7^-/VK 4 01 5 0tlbp7-f-;v 

4) JRJMfJiJfcft, ^fttt2 0<@©2j§&» (2©}| 
ft) £#0" pTIE" /J*jS»«©t«>-Cib9, 2it/J^ 
£©#Bfi b p y 4 -a, KK J; *> PffJRWKi«S $ *T<$„ 
[0 4 4 3] 5) iJ—^/Vf^ 7.^ y zf?<DV- Kcoa 



^y^x^y^ y y^irfcv^, x ©P&j^fii© 
£&§B£I&V ^cftfe©^ 7 y tt&© £ 5 *ffi«r«Fo. 
x©M*6ffi«<o»» < -0, 
y©BMflffi* < -0, 
*¥f/i'? < -1.0, 
ilf/V? < -1.0. 

7 KW*£j«U 18 3 1 asfl&fcSftfcfc, #M£ff|f 
rjicttt7*t >?ji<ftmco&5tlci& LX 2 o© 

-y-y-y-yy/^ijo^Tc^ 1 x ix-hzm 
-a-, t K^^aui 8 3 itt+&t£mmi>wbti&*. 

[0444] -y-y-y- v^u^ijo^tc^ 1 x 1 x-t^m 
•a-, t" k^*j*» i83i (•tftPioyioiDmmzt. 

Xfc¥-7 ! /i'?&mBkW:fcl)HZ.Z 0 *:©&, 7K^M 
S183 ll4*©ffOfiE*Sr**5fc«>K*«f*/u^Sr 
gUfiWlilO*.*. 7* K^ffejfttt 18 3 1 l4ftoM#5r 

^SriK. ^©^ T K^^AAl 8 3 1 ttHitr 
SrSUSSKcan*., ^tTrwiiS^HOig-fc 01 

5 o©±ig^*3(tsy-i'ry7A(4fi : 5ij--©Ty-fey7j 

fe&*1- 0 ^©flfitSrffl^T, ff?iJI4^yify-c?ydr^ 

iii*-^f-f^? y y^(c*jv>tp]«*)(gfp-cM'< 

[0 44 5] &mm*&tiL Lfcft, T K^irftW 1 8 
3 1 I^yfy^7f-7;w7 Ku-^?r**.5fc*, 

(y-^lfy-fe^sffi^$^TV^^, 
§151 8 3 1 HtftW'fyfy^^f- yA"t>*»*i^KaS 
o ^ y 9 7*=T—~7 >V-><— 7,7 Vuxtt ( y + 

0) (djstts-r^xyy^x-y/uaivhy srjti-o -f 
fc^, r K^ft*w 1831 teztizxmmzmz. 

TK^tiJBll 8 3 ltixSI^-fyfy^^t 
[044 6] Hife^m©ffi^€r^«) 5 i t , Sii^M* 

»f4, &&frmm*&<nmt>K'?±Mc$>\-tzftm<Dm 
^ftumivx^zzbxhz. mfo£mc*st<^x, & 

ttzttffiiiX^Zo 

[0 4 4 7] 01 3 9(-t5V>-C, ^Wcn?-! 7 ?7 M% 
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»Srffl^T#B8f5. *©fL MPflSBl 8 3 3 
SrffiPfl-fS,, 

[04 48] ffUffi 1 8 3 4(12 oOf^ttt 5. 

i) ftmmt\£if*fl'*&ttz 9 

y-^fctUA-r*. JSIgSlSl 8 3 4©«J»Mtt v + * 
/HCftCT. 0t>L<tt<»JEOfl[t£|Dffl'fl:$*VS. 
[0449] r/n y * 1 8 3 5 fi^ggfl 1 8 3 4 ©Utf} 

o-Ufc|t***lt«fcH*/J4Hcl|i!«i-5. * u-c, 

©tH^(C*3^T2iS/MR^CDfi@«*-^/P7 J ^^ J ? II 

Ktt, II^Cfc^tiTS^t tfy h©&£^ 
Sft-Cl^*. CO*#i[B:«F.-§-*>9©2«)*|»i:UTfik 
[0 4 5 0] ±f-?/^S8 2 4 2^fT^.5t 5 1 O© 

BfM&SKftsttfT ^JUgTfc 5„ fif JOlRJItt 2 oco^PsI 

[04 5 1] 
IIS7] 

























*>.:, 




k< 






b i.o 








h< 






b 3.a 


hi 


*3. 2 




h< 



[04 5 2 



i l 



255 



a l 

tf-C*»ttltf*?>*V\ 014011 £7*— *^*8B2 
n © f KfS If y -fc >*Mz.ft$m%t& 

ant****, *©«*fc**+ajn*«, asicac 

[0 4 5 3] *T^JR*mT+afc»fctt2y'nyy-tf- 

u j8iiLai:iniiLao"7*-^iJjEu<aR$nai5fc 



8 5 1, 1 8 5 2K±oTgW?$ixa. 
&U:^©#^£fT^j©£{»ll£:fctf5 2o©?ij x apt,. * 

[0 4 5 4] S5 1 iNf */WC*5Vvt\ tr^-fe/KOj:0± 

Oft*tt**f-^ ?;WWmcm*Lt>tlZ 1 8 54, JP 
JWBJkl*rt*flJB:8 tfy hfcSJ«J&T&*l.5 1 8 5 5. 

V KHa«|J" 1 8 5 611 fing-U 1 8 5 4 CO A 
*^4o{c<ci«t5tc**|a5ffl^SrffBE^J-r5o 

■ffk\ 2 4 If y 8 (fy h fcf^-fe^^tWjEL 

[04 5 5] "A CS&agT 1 8 5 5 tt*PggB©tti?j© 
^Tftl 2 If s» h*SJ*T, Rj£fc«VMS9»T&*l,fc 

*ti-5^fctt5ym. "AC»BS|5" 

yy-tz,*. 5icaa:S$nfct*, o«T©£T©ffiiio 

K % 2 5 5 «±©£:T©ffitt 2 5 5 (Cfflx. bftS,, "A 
C^a§B" # 9 y 3 fcR££ftfcfc#, 

#©T&8 \fy ItmtiistiZo 
[0 4 5 6] ±7~-fy<7n 2 4 2 fi, _hffi£A*©Eftl 

feaSrfT* 0 £.0 icRS^ftS i t t.Tf 5. Rf+IWiJ 
a»^%-f- < frft 5 - tcoft 5 = y If a— ^ T-^rX 

tfa-^T-^X^^ + filfelSctt^fcoTV^fc*!). ^i- 

a»fp* t 3Wt T? * 5 <t 5 => v bTa «r«^5 c 

[0 4 5 7] 3. 18. 6 f-fZ-Wi/apmnt 

7—f*\yi'3.W®U2 4 Ofi, 3yot5-f2 2 4 
K*5»t54^n/^ h(D^ttiL7 f -^^-ts/iy 3 .2 3 
0 5r1ix.TV'>2>o 9 s — ^^ftyi'a 2 3 0\$?<{ h 

•>a^^!) 2 3 0 (0 2) JC*J»tSIBID*$©iaiD9 
^ * y It S C <D 7 4 V & fHft dr ^ 5/ ~> a 7 •< V t flf 

[0 4 5 8] 7— 9*-*-y~>*.ffl®il$2 4 0li2o(D^- 

^yy K*- 4 7, 2 4 8^f,©x-?^5jt 
^f— tf*-T£. Sff-^Mt-r>a2 3 0l;#ft 

^7iyf^5„ 9*— ^ ,! *-ry>at!|«lS52 4 0{i 
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l47*By*? A<OT# Z>TYisx£.fm&h*), r-** 

3 0 K-CI48 7- K (2 5 6 tTy h) £-C07 

-y-^^U'-v'a V^-tf-TM'lF 2 4 7, 2 4 8l:|| 

[0 4 5 9] yv^RAMI4 8oro343£l,-CT KU 
*<^#^y^^:7;6>e>#<5 (M5 7^y7K^ 

&2 5 6 tfy MCJ|Mltft»tfe*l.*-»©W»T KUfV 

[0 4 6 0] ^s-'^ll 5 fiJlT©*- 

5. 1. - r/we— K 

2. *-a*-«fesiBae»*- k 

3. ^W^-jRftffiWaElft*- K 

4. J P E Gff-SHfc/t— K 

5. ffij* J PEGI^-K 

e. fi?g***~K 

7. ^3C— tVw*— K 

8. «tSWb*-K 

014 111 H2lCi8tt57*-**iry^»*WPlS2 4 
OcorKi'.x, r— y\ Mi^n-tf-^+ty^ 
2 30££^-tv 

[0 4 6 1] T-*3-¥yv'^2 3 0i4, fUMLfcyV 
u-y h-ry^i-yi/sLSrftitS. f-#*ty^a 
MMI2 4 0t±, 'S-^-Ys/v/i^^yJCioltS^^y 

h})&Gi-z>*y**y 1 8 7 2«r#wib-c*5*>, 

h y yS^a 7^>-^§l£^y7*£*VC^3fl- 

y y-f ^*J#»-C***>if 5 *»**f ^#Sb 
#fS^*y 1 8 7 3<t>(i;i3 0 4WtS'^7'f>' 

[0 4 6 2] *f—9*-* y*/*.W®%2 4 011 ^7 
VKt-^t-ffC2 4 7 (El 2) tt^yKt-^f 
7V1PC2 4 8 (H2) I)^©7-^S*^7VK 

/<^-f 7*— .xfcat-ciflWMcih- tr*-?# 5. 

»fp|r*JV^, ^7^^^ If 2 4 7, 2 4 8 
(02) <Db*%lbfr<D-j]hl,<nmi]i*4>Tytx 
1 8 7 4fc«{fcU 7*-*R#flH§-l 8 7 6£tfj-fo T 
KU^fejWH 8 8 m^y^y^^l 8 7 4CMU 
loR< tt^«±<0^±*^eiJT K^1 8 7 7* 
fe£-f So df-fy^affllWai 8 7 814. £/jfc$*lfcT 
KU"* 1 8 7 7©^^7 \*UXfo%rf 1 8 

7 2£HSE-i-a* fct>fc» Mil-fS^y^?^^ 



y l 8 7 S^St-SCirtcj; 9, 5*£;h,fc7*-*a s 
.+t>*/a2 8 0{c#«-t-5*^5^Sr!HI»fi-S. ®* 

$)ft,7t7*— y-#**yi'a;rf-*y 2 3 O^frfc-fS t 
t, 1*7-^ 1 88 0^*11, 7^^r^Vh 
0S#) ft#l 8 7 9^iaJI-t-5^U— i 

2 4 7, 2 4 8{Cj@ie>tt5o S#£tl1t ! f—$'& 
df-ty^a^*'; 2 3 0i;i#£L&^££\ A7J/^^ 
^-7^-7 18 7 1 t y?-7x-^-f y 

f2 5 2 (H2) £iICT\ RjfcSftfcf*-* 1 8 7 0 

tf*fflbt*ya>&7 7-^ 1 8 7 01*^ 

#{f^l 8 8 2£ffl;>jU RjfcSJxfcT*-* 1 8 7 0^ 
£j*3ih,fcT K^1 8 7 7*t§tt-f5r tlCj;ot7 
i?fJJv5. Ty/ y yS*flWl 8 8 3&U<il#$ft 
fc-r-^ 1 8 7 Ott^r^tt^^ yS/a||#P»l 8 7 8 
JU** + y*/>^*J 2 3 0|£j£6>;fi5 0 * 

©H^^a^*!) 2 3 0(CHjg1-5*^y^3.7^ V 
a**Uf*?*-* 1 8 7 0fciot3!«SivS. frU>* 
tr>a7'OW^'7 Kt^fc^yV-^y 1 8 7 2f£ 

%>&mi 8 7 3 3jSfi»*il*. T7/ y y^ff^l 8 7 
9 147-* l 8 7 o 1 1 1> tcBB«-f5*^s? yKt-^f 
7--Ylf 2 4 7^tt2 4 8 (02) l£j£b*v5„ 

[0 4 6 3] 014 2J£*5^-C, f-^tr^ 2 3 
0Wp«^-y«^Sr^1- o f^Hy>a2 3 0ll > * 
■fry i/zLyj^&frS 2VfoZ) 1 2 8 iBcO^-V y i/a. 7 
-f^CO, .... C 1 2 7Srfco^-r u-y Ky/Jf 
-fry fat t-CSa$n5 0 3-^y^RAMt4SiJ*<73 
TK^tSOtfS^^y^^BO, . . . , B7 
Sr^-«b-C*5 9, #^^ey^>-^(43 2 t*y h£>/<y? 
7^yi 2 8<@<7?Sr}f #*-lry^3.7-ryC i 148 
o^^^D/^BO, . . . B 7lC*5VvaSS-f 5 8 
O(0^7-fyB0i B 7 i «:tt5, 

[0 4 6 4] £j*$Jxfc*«;**yT K^<0«j«SrBI 
14 3|C/ft. *J^$HfcT KU-^142 OfcTy h^^T 
KW^., 7 t*y hy^yr Kl/X, 3tfyh/<y^7K 
2 fc*y K^*»fe*<5 3 2 t*y h CO !7 

-K-C*>*. 2 0 tfy bi?^T K^tt^^T 
^yV^ey l 8 7 2^cfEti$^^TV^2,^5'^^:H:|{51-5<0 
7 tfy hy^ VT Ku^^dr+y^aLy^e 
y 1 8 7 0(C$>5^iS-TS*^yiy3.y-l'V(DTKU'X 
icffctvS. 3 t*y hAy^TK^tt+ty^^^ 

y i 8 7 ooyffig-r s^-y^yycDT Kw-^ic^^n 

5„ 2 If y h^-f YT KU7.J43 2lf y h/<yy 7-Y>- 

[0 4 6 5] I144H:, f-^+r> 1 BW2 4 
Qb7— 9**yi'a.2 3 0rofl|it(737*n y y@Sr^ 
f 0 ::T, 1 2 8X2 5 6 tVhRAM(1^5'v/a 
y-*y 2 3 0£1f$U wtVf48oO 1 2 8 X 3 2 t*y 
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RAMtt»t&3MTfB#-h (write), 
T KU*#- h (writ e_a ddr) , St&^y* 
— ftf— b (writ e_d a t a ) £rj=?K> 0 
*»TflB#-h (read) , 8oO>*7K^#-h 
(r e a d_a ddr), 8o©f^f- ^tti^JJK- h 
(r e a d_d a t a) Srj3f0 0 ^ y 9 ysL ^ ^ y 2 3 
0 <D&X<D/ * !) /< ~.<0|*H$#t ii^Sr"5Iffi{c 
Sfcft^yS'aiftlffr/ny V 1 8 7 8 f &*»T 

2 3 0liff^f-^#-h (writ e_da t 
a) £iICTfl-gM*!Ja»e><Dl fcL< fi^fLgJLh©? 

(writ e_a ddr) (Cy^ V7 KU^SrHWU 

8 : l#a{b»MUXtt9*- 
yS'a.fRMMlS (addr_select) ©fg|#p 

^■^ 1 8 7 8d>&ROTttflr*tf£j*&ftS. 
^ffi-e, ^r^^*!) 2 3 0©^*!)/^©8o 
©t^7K^^-h (read_addr) 

h (r e a d_d a t a) ^f ) 8o©I/ l i5/'!y^ 

[0 4 6 6] y>a^*!) 2 3 OtfV^^B: 

^p^^ApTIBT KU^&fiWBl 8 8 l*«Po-0*$. 
wfK4Ji5 8 oro{4«-^c7\ 8 -o<o^^e !)/■<>• 

^*»f>WP^T^-fe^Sr-5r^lc-t-5. #-<f©7Ki/^4 
1 8 8 1 \%T YUX&rfm 18 8 1 <&«s»*- MR 
%.<Dtzib<Dd c c^e— KAA, 4 1/7?* hA 

y^mr k^4*»i 8 8 i ©ft**- Ktt, 

(a) d c c^e- KA#~ (D{f^#>!rC7)T K^.*±J& 
»1 8 8 l^^AT^ir^^-KtcU fl-SM^-y 

t>L<tt*;h4st±©7KU';*4j**l 8 8 1W7KI/7 

(b) d c c^e- YAti^<r>im%ft&*<DT Ki/*£f& 
Si 8 8 1 *m®t£*- K!C1"5 J PEGiya-f^ 

fti, #*©7 KU*£j**l 8 8 llCfiKyxy** 
/<*-y HA^w^^y^^^A^Sn, -fvf^ 
*7 Kw^fr4l*r<5. fW^-Kiaot, 7KW 

[04 6 7] 8 0107 K^4J*»1 8 8 lf480W^ 
*S»3llsI»*»?)<ifoTi39, «AA i: LT-^-* 
7K^, WAtUW*y7K^*»odcc 



^1 885Wyfy^ y h ©j|&£-£T-fc.55i£ 
<V<~ XT KU-^§rf5iSL, d c c^e— 1 8 

8 8 \Vf— Z^r^y's^ 24 0 OOm&O^ft* - 

K (dcct-K) SrBtt-fa. 

[0 4 68] ^^^ ; &yi 8 7 2(Jl^Dj ( ^ 1 128 
X 2 Olf-y h»-r/U^JK— hRAM-Cffij$$tt5„ 
RAMJ4lo<E>#i§:#- h (update-1 ine- 
a d d r) , lo<D^$ irt#- h (write) . 8 

cxdS^tK— Mtag 0_d a t a tag 

7_data) Sr^foTVSo ;jtll 8o«7K^ 
8 8 l*igifi|Blt$tfCV><5, lotL<l« 

Kf^%^f5rt«rj;>)^-h (r e a dO 1 in 
e — addr, .'. . , read71ine — add 
r) ^c^^^^T8o^)|^B#co/^2/^?Ts/7 e Srp^lelc1-5 0 
XV^-fc?)? ?T VUXlttf— b (tagO 
-data, . . . , tag7-data) fab? fit 
mil 8 8 6}Cffi^$tlS„ tf-h (update-! 
i n e - a d d r) ©^^^^e"; 1 8 7 2 

1 8 7 2££»)*^»§{t*{4£/££;ft-5.- 
[0469] 128 t* y h©7-r^v a 1 id^^Dl 
8 7 311 +t5'->a^ ; e!)2 3 0©^t7V'i7'l' 
y<0va 1 i <Hft»«r«oTV*«. Clfttel otf>##*K 
— h (update— 1 ine — addr) „ 1 0(D# 
toT^'K— 1- (update) , 8o(73gg^^./Jf— h 
(readOl ine — addr, . . . , read? 
1 ine-addr) , 8 OCDgfc^ pTtStf- Klin 
evalidO, 1 ineval id7) 

45i2 8xitf y hc^^y-efcs, pyt^vtm 

C<fc5£, ^(48o07K^4)«|18 8 1K, 1 

tl^7-ryva 1 i d^li$r^!£$-&Sii:(cJ;tJ, 
sK~- h (readOl ine— addr, . . . , re 
ad71 ine-addr) \ZM LT<0 8 o©^y 
^Ty^nTiBlC-f-*. :©7^0S7^>va 1 i 

d e fcTy h fi/K— h (1 ineval idO 

1 i n e v a 1 i d 7) fi>b? VVeWfr 1 8 8 6lcffl^ 
£*t3„ £<^Cj;o-Ctt, 7-fVva 1 i d^y^D 
1 8 7 3<&*t /-K-KC, 3jf — h (update-1 i 
ne-addr) H5>fVva 1 i d^H^^l) 1 8 

^»fW»^oy^ 1 8 7 8d»64*i-<5. 
[0 4 7 0] ZPttMUl 8 8 6«8OC0^^Jt|!5§g^^ 

b*l"Mz£oX7?-tex£ivZy-{y<D?y**}) l 8 

ft©t a g_d a t a A*, £ ftfcftgM *r !) 

7 YV7.<r>9>fT Fl'X§:WWl?>tclb<D t a g_a d d 
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R^-Kff-S- (dcc_mode) *^(t®t5fcfeO 
dcc_input, %&£M£1ntzM-mT K^©7 

>vali dtt«**y 1 8 7 3fc:£iSE-fc-:/$;fVCV* 
57^fyv a 1 i d^ffi5rSttIfe?)fc*co 1 i n e_v 
a 1 i dAA^ot^5„ JttfcgB 1 8 8 6f48o<DT 
KU^4J*»1 8 8 l*ft?ftlC#UC8O0!>h i tffl 

T^t^$n3fifCfc5^^*!J 1 8 7 2<Drt^i: 
SWSBfr, h i tift^©7'f^ro7'f^va 
1 i d$f§tf y M 8 7 3^til73$HS 0 £©3Ufifln? 

ftoT, Kw*©^fc;l-5ftT&try hftfttfrit 
•rt*Ufj*v\ £*u**^JtW»l 8 6 6j9»*>T 

fl» ( d c c _m o d e ) fcRJ&TS £ £ T-WUglCft 

■So 

[04 7 1] *^ry»>3.IW1lpaJl 8 7 yS/a* 
2 3 OiZhZf—P^OTf-tXtfTZmtlibZ, *■ 
^5^KB2 4 7, ^7VKC2 4 8A^©It (p 
r o c_r e q) tilfcj (pro c_a c k) Srgtfflx 
5. R^-KiaoTIi, *-C5'v'art!l2 3 0O 

*-c#3P#, * ^Jt&as 1 8 8 ea^-tw^y ©9^ 

Vfcfcy h£tij-f% a$ttfctyh«# (hi t 

0, . . . , h i t 7) (c:*j-LT, ^MryS'alWWH 

8 7 8 fitf — h (cache_read) (Cgg^^pJ 

^©K*a**prtik:i-«. ty ms- (h i t 

0, . . . , h i t 7) (p r o c_r e 

q) 1 8 7 e^fflStLfcWKtt, &J&£ftfcg# (e x 
t_req) ttttf-^ ©3r^ V S'a 7-f VCDfl-gM 

7-f^fiA7J (ext_data) a*orf6fcB# s -?r;ft,£ 
ICt^yv'a^t!) 2 3 0CD8OtfV<:/7{.:#£i*> 

1 8 8 efc^iiSfu *©7W , VO?^^«HK' 
y hi 8 7 3 35ffi7J$tv3 0 

[0 4 7 2] ^ft yi'a^* P 2 3 0(0 80©/^?^ 
7*-**- jti-JVl 8 9 2{C&3<^< 

h l 8 9 4f;i&fitttf f>tv5 0 *SSW£*- 
K"W-**- ^T-^if l 8 9 2 tt, giiW^e- 
(dcc_mode) fc£j5fc$ftfc*WM* V T KWH 
cD/<-T hT KU-X (b y t e_a d d r) SrffiV>5*{C 
.tot, 80W^y/<^7;^fcfti7j£ttfc8oro3 2 



m<D=e- K"W— 2f"*vflPl 8 9 2 
14, 80©^* y^V^^e.tti7J$ttfc8 0(D3 2 tfy 
h!7-Kfcit*Htfrt-5. f}iELfc:ji9, x — ^ — # 

[0 4 7 3] 5*f4*C0©fgT-tf fctv5 0 

1) ^Bty^/asy Hi^tr>i»Wl 8 7 

2) 8 0©7 HU*£jfc3. = y h 1 8 8 1 ttftfls*- K 

[0 4 7 4] 3) ^SftfcT KU^<0y-yMfc*»*3# 
— h(7?^^^^yi88 6c0 4 7*n;/^(rir— 7*£;tvC 

4) ^:©7'f>©7'fyv al idt 

11 8 7 3tfSffl£*tfcfc, g#$ftfc7*-*tt4f-vy"> 
^^y 2 3 0K#£1-5i*#£*t-5o 
[0 4 7 5] 5) #SL^f-?tt«^1890 
5r^-LT7 3c2/^-^tt, y>aptjejj 2 3 OCO80 

©7* o y ^ y frh<D*f—v y 4 xnnm 

y 1 8 8 6{c##ii*ti, ; &©7'fy©7'fyv a 1 i 

dttttl 8 7 3«Sffl$ix5. 

6) ^TOS^x-^^df^y^oL^^ey 2 3 0(C#^ 
-Tfttf, ^nti** ibtlfc^T- y hJB-S-Tfr/p-fcyS/l/ 

[0 4 7 6] BtfJSUfciiy , =>7°i3-feyy2 2 4<£>£T 
©a57> (0 2) te^fSpCBu s4 yf-7*-X3 0 3 

(02 0) Sr^fcTV^5 0 7*-^*+y^a«!l»lBJ2 4 
0i^t7'>i 2 3 0©^?pCB u s -1*^^-7 31-^ 
l^7.y-t75pMfflte, ftiB©B4 2^e>B4 6*-eiC|2 
*$tL-CV^5 0 ^<DUv ! ^^(DS!:^tt7-^MIf|52 4 
0©f£8)£r»P1-£<> ffi¥©fc«>, 2 0©U-^^^ (b 
ase_addresstbc c_mo d e) SiltSrH 
15 3 (C^-fo 

[0 4 7 7] 3 f—9**y*/a.<tim%2 4 0 tr-^* 
-V y ^ 2 3 0 flfcftWi tf , 7-^^ ^ y ValM 
ttftlO^'C©*^ y S^a 7-Y V^-te^lc LT^*^- K 

fflfflU 24 0i^ffy->a2 3 0 fi^Otifl^- 
KiC^y#*?§ 0 " I n v a 1 idate" KSrSfe 
^1t±X<D*<- Kfctt" Auto-fill and 
validate" fc f 7 ^"7'~> 3 l/-jfihZ>„ d c c_ 
c f g 2 U^^JCI fcfy hfc-fcy K-rS^ifdJ: y, 
^TO^r-Vi'v'a.Srb ase_address l^v'Ts^ 
tt-7**tvt^57 K^*»bfe»5l t^-ctS, 
CCDK^orfl, ^7VW-WfB, C24 7, 
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2 4 8d>bWr-*g;im*lh£;jT,5„ 4Hryis a tt» 

*£fifc«B 18 8 1 ^DTK^ £-tt}7J U f*3§p 

*;/>^y Zm^XZtltf**:}) *-ry->3. 2 3 OlC# 

2 3 OtC#£U£Vv!§-£\ AiM'y^-^x-^^y 
f-2 5 2a>bx-*#g#$;ft,5, ^fttf>oill$l$g 

[0 4 7 8] [BjBWig^KtfU lo©f-^7^f 

^o^5^>3 2 try McteH-faj: SlcfcS. irof- 

b. i/y^W^-jRfeSW***- K 
-©^e-K-CHt, g*tt^7^K^— tf-TM'ifg&Ba* 
& 1 2 ITy h'<4 b<DT KU^^T'ttS^jtS. 0 6 0 
fc*S*vC^<5«H;:» B#x-^7Vx.M48 If y h# 
7-W^ffiT?*)5o- 1 2 tfy HT Kuxi^ 
IC1881©inde x_p a c k e tA^lCA*^ 
tl, 8 007 K^^fifeSSl 8 8 lfiia 9 6tC;F&ft3 

(4*1 2 tB6 1 l:iotW6ti5. »^*y7K 
U-Xfi, 8o(D9 If y YUlst'H hTVuxt LT 
^IfiRAMCDSocD/^y-CD/W b£}g-f 

2 4 2tC«fc9^9VK^— ^-f-WWC, H6 01C^ 

Hlf-^tt+lryfa^*!/ 2 3 0{aDi*<5fc*. 
fir— ^vSr^-ir y^a^^-y 2 3 Otco — Kf 

KT?W\ 1 2 tf 3/ h!7— KT K^tf^V 
K*-^>f IfWB 2 4 7d>fe^ltb^5o 
T-YrA|4H6 2Sr#figUTWS6Ufc3 2 tfy 
a*«T?*5. 1 2 tfy ST Ku*ttT Kux£$gUl 
8 8 1©index_packe tA^tAA$it, 8 
O^T KI^&WR 1 8 8 1 n s 111 9 6 IC^ftS^ 
O8oo»45 3 2 tfy hfl-fflS^yr Ku*Srf£*. 

*«M*yr Kux<D7^yi?/7 Ku^Ht, ii2 

£126 3{C«fcoT^$jxS, *MM*y7 Kl^Ht, 
0 6 3«r#JSLTflMLfc«fc 7 try h 7^ VTK 

l/*<h 2 tfy h^^T>'U-X(^(tP>tL5 9 tTy hf K 
^5:ft5 8i(09t*y K7K^tLti|!f5$ft 



5. ^^7Kl/^W$Wofci^, ATM^* 
-7i-^^^yf25 2 (0 2) ^fciSgj/iT*-^ 
B-K*H5*T?*tJ'fa Jiff Jk-*" 5.f^ #f Uffl 

d . J P E G?f -JHfc^e— K 

^OT- KTtt, J PEGfffftt- KM&B&r-;/ 

—7/U<Dmmzi^X\tJ PEG«HHt J E-K (*1 

4, 16) ©tr.6^j$^<e>tT,TV5. 

e. {gjgj PEGgft- K 

f. frWJR**- K 

g . Disabled ■=&— K 

i©*— KTB:, £T<0g#f4A7H>'*--:7:c-;*x 
W 5 2tC/<X$n5o 

h. Invalidate (M$)4t) K 
tO^-KTB, 7>f^v a 1 i dft&tfy ])T 

5. 

[04 7 9] 3. 18. 7 

y^ 

tf-^ If SS 2 4 6 , T r -^=3f-+y^oL$J{HJ*|5 2 4 0, -ft 

-r„ ^itztitii>i-U-<y^-yx.-^»i$m2 3 8 £n 

* y 2 3 6 K&gfcT Y^X-h?-** 

[0 4 8 0] A^y?-7x-X^7f 25 2 J4-< 

-*t Ki/^^L<tt*x h^^y^yytcfcs^^y 

5. 2 OfiWT KUXtfy has&Rfcfci^ Cfttt:^- 

fWB*»b«>K*te»UT, A^^-7i-^^>f7 
^2 5 2{4, *i"f*-^©H!*6rK^O±ffi6K , y h 
H = /DtytO-<- KW-^lf y ^ 
©ift**S*-C*>«*N CO^©±{46 t*y h^O-Ctt 

[0 4 8 1] jg*<D±ffi6 t*y b&OXhZWrSrit, =f 
- ^ "7 y 7° *S 3 7" a -fe y f- © / * y |4 g £3i -r r t §rE 

c^yi»-<Difcil^jEU''*»S : d^JS | J1-5fc«)j!fe0 3 tf 

1) a/ntyf©^- 7.T Kl^^*»P>^"7-fey h 0 x 
0 1 0 0 0 0 0 0Htei5^^-7i-X#fi 
#6 1 6^^-^ bo 
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[0 4 8 2] 2) ayntyW^Dt^i^ h<D 
*7 VUXfrtt,*7±y h 0 x 0 2 0 0 0 0 0 0A> 
^Sn-^/M*!) W®U (LUC). j5Sfi*532 

A73^ > * - 7 a: * ^ y ^fc J; <9 sc y - 1 jifc £ n 
5„ PC a^n-fe -y -y-O/^y ? MS 

tcod^E I CJr^ae-fr^fc* i y-xm^-zm^^o 

[0 4 8 3] 7 K sE^tmXitmW 
& I B u s-O-^-^i-^fCfiji^ttSc E I CtL 
MCtt i -a c kftf-^ffi^nm, 
-^^y^-T-^&froSH-So b^LATJ-f 
7i-^^-f s^f4A7j£*bS!7- h'm&X'tyh Lft 

[0 4 8 4] A^^f — 7i- s^2 5 2f4f 

s r t *st# s*s, ftatt&Kfti* 2 o 
y=f-<DWtfeyv*f\z.hmmy?v H4, 

[0 4 8 5] lOOy-^^S^(CT^-fe^1-5CQi4^ 
£titcT-?<DT Ku*fc'<*-;* hftSrEttU g#7C 

**was*>5. fc*y-*ic*ht-*«aj*>«f«-e, ibu 

[0 4 8 6] 01 4 5{Clft-fr>f — 7^-774 
2 5 2 XYi/f2 5 2 itm^C Bus-f 

y^-7i-^i^7?7r^8 6 o^ict 

^!Sf-*8 6 3 £P!ffi8|S8 6 4©Bt200 I B u s h 
7^-/^6 6 1 ££fO„ 7 KU*«H|-»8 6 3 tee* 

w^tHHBs 6 3tt\ 7 KU'X^jEiSfto^frtKi-d 
fife, £*CJ:otr KM*»vy tr^m. W»*B 



8 6 4f4i*<Ag5fc£ I B u s h7^>-/^6 6 1 ti*b I 
Bus h7Vi/-/<6 6 2lce^5cDd^^:J?)5o ffi 

5t«ite7n ^7 ^ ^mx*hz>. 

[0487] I B u s h 7 8 6 1, 8 6 2S, 

<04 yf — 7i- XfrbAtff yf — 7i- .x;*^ yf- 

3. 18. 8 n— 77/U^y$J#pg|5 

B2fc*ttvt, D-*^?!)fflW2 3 614, n— # 

A-^y co$ij®i,$.0n — */M*y 1 3 7*ntyti*)© 
*i?^—/i>t<»m~1$ft%T?±7,m&<nte&<n£x i k 
n-*/vy*y »g|J2 3 6J4, ft**- # 
f--<f i 2 4 9*»fe©»#a*R*iA*-f — 
7.7,4 yf- 2 5 2^e>«Ogg^tfflU^*lCJC£;^-t-5 0 jg 
fc, J^a^>- ; ?-7 3 :-^$IJP|f|5 2 3 7 kW%<r>-W.c 
Bu 8A**>?»«)R*-IULi:»#a*B*K:»UTfc* 

S^fbfSfc&KF I FOA' 5 '7r^lLtV>5„ 

[0 4 8 8] ;«&Wfc*S^Ttt, 77-^h-l , y'77 
-^hT"7h (FIFO) ^y77«ll: l ^!)7t' 

* b?ji-$y?**:yffl$m&m^htix^z><, mi 

4 6 J4, &%W<0%$ 1 «HJgfif!|{c!¥V\ 4 h/<— 7. 

•5„ ^<D[iISSi-l4, «P7Kl9 10^O7^t^ 
*jKKi: , *-S2o©#ta*jK-h (A1 944tBl 

9 4 6) t 2<3<Dm^&l.^-h (C1 94 8tD19 

5 0) ^4il/-CV^5. S^ttJLtf- h 1 9 4 8, 1 9 
5 CXDy*— *^^i4giJffic7)F I FO 1 9 3 6, 1 9 3 8 
Mfi-C-y-^r 1 9 1 0d^P>ii}T<5cD{C*fL, 2 
ocD#£&#-jK— h^f,©7- *^tegiJf@©F I FO 

1920, i 9 2 2§r«y, ^afbgci 9 i 2S*-ey 

1 9 1 0fc[SjjJ»5 o f*««l«95 1 9 3 2 14, 
y^75s'*/ ; eyi9i 0^(04 y 9— 7 
W£±X(D*y Yu-Mt%-ZWM)-tZ>bMz±W<nX 

— Y7 9-t*i:iatt&. y ^U-S/i/rLTjr?^ ! 9 3 

4(4, ^*y 7W 1 9 1 0©fc*IC^75 y^^^E- 
y cd y 7U-r^W **«>£>Kl*glfcft#. MWMS l 

[0 4 8 9] $f4L<{4, ^t!)7Wl9 10W 

^7 t -^<DW.^-m^tm^mt. m 9 

44, 1 94 6HFIFO1 9 20, 1 9 2 2^, ^ 
{4FIF01 9 3 6, 1 9 3 8^^Sc^t±SL^- hi 9 
4 8, 1 9 5 0^OUm<D2®<Dl'- hT*ftt>hZ e Z. 
<D^*, #ta*t»*WUjK- M 9 4 4, 1 9 4 
6, 1 94 8, 1 9 5 OSraLTx— *«r(E2W-5©fc: 

XS^ty.TW 1 9 1 0-cote^(rg-t-5B#IW 
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[0 4 9 0] r-*!i, M 9 4 4, 1 9 

4 6 wr i%fr%m& lt> tyrwHio \z.wz 

ii*tt5o H 944, 1 9 4 6 (£$$£$ 

ftfcHI&te:, #J#Hiti? d©F I FO 1 9 2 0, 1 9 22 

<o*zm&-rzT$\ztez> 0 ##j2^#-m 9 4 4, 1 

9 4 6 SriiL-CCO-r — *&i£(4, F I FOl 9 2 0, 1 
9 2 2&--mc%?>A\ Xte^-* h/6S)iilTi-5*t?^ 
A-XtCjSA/T-V><„ -r-^/iS*U](CF I FO 1 9 2 
0, 1 9 2 2(C##&£;ft,£i, ^J^l 9 3 2(4DR 

T?~txi>mibtlZ>b, x-^ttftitju—l-T'F I FO 
1 9 2 0, 1 9 2 2frbm%-m£ft, ^!)7W19 

1 Odt^jZ^tbS, DRAM19 10^-7 H 
tii^lM F I FOl 9 2 0, 1 9 2 2IC^y 

-fey b^ftfcifc©^-*?- K#flfx.f>ixfci§£\ xrt 

*&$*l5o V^ftcDlS^fC&^Tfc, DRAM19 10 
^V^-* h^ffBI^Hfc^^^ii^., F I FO 1 9 
20, 1 9 2 2#£K/«£5;i\ Xtf £ <9 M^??**}) 

<D4^-y Mcfcl^-Cbx-*!*, FiFO^^-rs 

#rf5*T, #^^7K-h^P>F I FOl 9 20, 1 
9 2 2->i3Pgl$n/«C<i^(tT»ti? : *tl5„ '&%<n>Wi 
-g, ff Uv<-* H4, gJlilcO/^x h^F I FOl 9 

2 0, 1 9 2 2S:$|;UDRAM1 9 1 0lC#f&* 

F^F I FO 1 9 2 0, 1 9 2 2^P>gc^ffl$ttTDR 
AMI 9 1 OiC^tjiS^S^^-^feiS^ffW^ 
ft5o F I FOl 9 2 0, 1 9 2 2/i^cDy ? — ^^^5 
Mfc^-hX-hZfiiib. ##&3uK-M 9 4 4, 1 9 
4 5(0(4, fffllWlHSl 8 3 2/$S{fc£>3K- |> 

5o St K 1 9 4 4 , 1 946HFIF01 

9 2 0, 1 9 2 2^<Dx-^tej2HC»1-5V^^5SiJ'9 

I04 9 1] BJ3M±1UK-M 9 4 8, 19 5OJ4&0D 
l«-C»^-r 3„ M 9 4 8, 1 9 5 0 »9t 

1 9 1 0#Sj*ft, ttjft-f 5F IF01936, 193 
8^x-^^#ta*tt?>o ft$<Dr-*7- K#F I 
FOl 9 3 6, 1 9 3 SfcSt&SftS-^ft^, 
BU-H 948, 1 9 5 OtCJcSS^tbL^TOd 
&-5o ^£?K*l0^x-*?-K£#3ta4*J]$jI 

RAMCOS^WU4, <t r^©DRA 



Mg#aSfc5d\ g^ffiLF I FOl 9 3 6, 1 9 3 8 
dS-Wi fro ■gteg^ffl M 9 4 8 , 1 

9 5 o^n«±x-^*s*L/j: <^ofc b^T-r 

5„ -B.r©J;5^LT^ttlL^Ti-5t, FIF 
01 9 3 6, 1 9 3 8^y°})±y b ZtlX^^y 1 --!? V 

L^-h/J5^^/P§r^T1-'5t, F I FOl 9 3 6, 

1 9 3 8 (c:gSo-t^5l^&3T-* i>mmztiz>o 

[0 4 9 2] tCDRAMay h n-/W7)3ft/Hi£i:[§J 
5 J; 5 (-+3*:*, r/D-fey h $*vTV^$:<Dx-* 7 
-K^Ttei§$tl5*-C (i&(4, *fJfrt5F I FOl 
9 2 0, 1 9 2 2^45^, K^fcHtFIF0 19 
3 6, 1 9 3 8tf—mctj:5>1ZT*) b&my&b- 
£ JVfcV > <k 5 I- D R AMT ^ -fer X^^Sft^ttSiJPS $ fl 
So ^:TC0T^-lr^^— h 1 9 4 4, 1 9 46, 1 9 4 
8, 1 9 5 Ofi-^ix-m^m-f 3^— * hUfl^T KU- 

1 9 4 2lCy $*vtV>3. Md 

yr K^-?swi-s*^tgt?fcSo m^r^r^? 

JiDRAMt^^/W KL^^^fifcgBl 9 4 0 
K4.?):liR$ft, ftTY^xUW* 1 9 1 6<k?iJTK 
l^**?^ 1 9 1 8\z.m%iX%« TK^OttN^ 
y M4?iJ#?^ 1 9 1 8fdA^$tv, -*(0±fi:T K 
UXV-y M4fT#?>-* 19 1 6^AA$H5„ $>WC 
gPl 9 1 4(4, DRAMOtTT K^^'f AOF^ttff 
1 9 1 6d^^*yrK 1 9 1 O^frT KU- 
^SrW^U DRAMO^IJT K 1^* 9 4 A(??^(C{±i?iJ* 
1 9 1 Sfrbmr Kl^^SriiSo fTTK^*^ 
^19 1 6i^lJT K^^^V^l 9 18ft, 

h^M^B#(C*3V^Ti^^y 7WDRAM1 
9 10^P-K$M o rtt«, *K- WJt/l-omte 

*S*H-CfcSo ?IJT KL-^*»^^ 1 9 1 8(4, 

ft, *TT KW'^^^y^ 1 9 1 6(4?IJT K^*^y^ 

i 9 i s&atucmfrzb-fytytyhZtiZe ®% 
[0 4 9 3] ^mmmxit, t^yruj 1 9 1 0144 

x 8 fc'y h'U b 5 4 VSr-&A/-C*5 9 , !7- KSfc ►) 3 
2tf S ' h«r#^1-5i{gSU-CV>5 0 S(c, ^tt^tKD 
ttii^-H 9 4 4, 1 9 4 6(C*jT£;-r54/M h 
CO^tiX^^-^ff^-o-fes/ M950, 1952 
frh'Q, fiSTOCf-^iM* !J7K 1 9 1 O^^f 
tl^tWD 3 2 hx-^ V— F<DZtl?tl<D 8 f y h 
8MHc##&*ft5 «t 5 (c-fSo ^D7W1910 



-89- 



HiX'h6tclb, ttfc-rZF I FO 1 9 2 6, 1 9281: 

ZftxX*S<&m&hZ>o rtLfeCDF I FOl 9 2 6, 
1 9 28|j;ttii^FIFO1 9 20, 1 9 22©ny 
h a -MZ. m V ft 2. © t IR1 Dff -S§-C => V h d — /v $ ft 

Sri*. FIFOl 9 20, 1 9 2 2— fflf- ^©f^ii 
i £ ft £ 3 2 tf -y h <Dft to 9 4 If y h © JfcJS 

ffl^e>*i,s. mmc, &m<mi 9 3 0 f^#-m-ft:SB 1 9 

1 2 tW\ZZ.o\Zziy hP-/i/$ti«. *fc$ftfc#£ 
ii^^-^/Hi, Mffffil 9 3 2^A^$;ft SiJISigB 
<±;:ft<b©ff$ft£)Ev\ ^fiflsgpi 9 1 2{cJ: *> ^y 

[0494] 0146 (Dmf&tZftmU 1 9 3 2 <DU®T 

■ciwm-s. 0147 ii, 0146 \cjo\,^xmmn 1 9 

3 2©»f£©sfitt«r*i-#«0-e*,5 o *y—Tv-?<n 

%. t y ir y h <^TR#I£, WSif4?S««){C I D L E 1 
OO^ittO, i^l-Ct^tODRAMayhn 

--MM-aw yr^-f:/ (high) tc/^ 9 % #»u 

1351 9 1 4l4fTT Rl^SrD R AM7 W 19 1 O^-ji 
RASDEL 1 1 9 6 2ttf£^S&£;fr5. Wdj/ 

*fcfl§8&tt I D L E 1 9 0 O^fgKMSo "t 5 
DRAM tRP (RASyDft-v 5 ^^?^ 
SIRS) JfJSBtfWc^ftfcBffrRASONl 9 6 ettffi^ 
i5#£;ft ^(7?^ fTT hc-ZflfRASli 
a — L^/MC&S. t RCD (RASd^CAS^Oif 
JC*-f Sl^WR) ^ftfc^tlfcg, COL1 9 6 8t 
M^B&Stl* DRAM7K 19 1 0— A^-fSfcft 
©?(J7 K 3 «t ? IC#m-fbSB 1 9 1 4 tfS*^ 

-yf-ZtlZo {*;©^Dy^i r ^-?CASONl 9 70 
ftffifcW^ft, DRAM^lJT hn— 7 (CA 

S) flHMST^^o-tefcS. —J., tCAS (C 
AST9?4V9>(%lsyMm tmtcZfttzb. CA 
SOFF 1 9 7 2tt§^If$^ rO«-?DRAM 
^iJTku^^hn— y (CAS) lifftWy7^-( 7 

r. tic^ot^S J; 9iSiv>7*7-f *y f^rot 

ffttarsicttat-f-r*»&-, *hH-ltCP (C 
AST'y^^r-^^^^^^JRg) SH^fcJJvfc?, 

CASON1 9 7 O^H-^'JtU DRAMWTK^ 
*hn-7 (CAS) ttflW^TV ?n-|c*4 0 % 

L, £<Q ^774* 9 tectum !/7U 
y*>aiSSUitoTV^5»& % tRAS (RAST^f 
^T^S^ffilPE) it CP (CAS/Dft-^ 



SOFF1 9 74M^I$$ti5. ^©ttffiiT\ D R 
AMfiTK^^FD-^ (RAS) flHfrtt-T J/7 * f 
W K * 5. JtWny^iy STttMlMt I D L 
E 1 8 6 0««t«#U fcOlM'^A'BM&frlWrt- 

[0 4 9 5] RASDEL 2 1 9 6 4«lT!)7l/y 

>>3.s!*as*asn5i:, -iirp (RAs7y^+ 

-^-fS^IWH) aUWfcSftfcfc, RCA SON 
r9 8 0ttll:«|Sh5, ICtifDRAMjIITK 
l/^Xt-D-^Sr^T-f^n-l:*!), RASD7I/ 
yJ/at^^/VO|[tDRAM CAS*M^-f5 0 ft 
©^By^xy^-CSfliRRASON 1 9 7 8 — tf 
toft, DRAMfirT hn-/ (RAS) (4T^ 

T-fl/n-C^S. tCAS (CAS7^f-f!/^5 

^?mm &mit£tizbm®\*RCAsoFF 19 

7 6-^Ttoft, DRAM?!l7 ho— 7* (CA 

S) tt-fVT*** !?/N>fK:fcS. -fi tRAS (RA 

ASOFFl 9 74^'frJptl, DRAMfTT K^X* h 
n-7 (RAS) Htf VT^WTWldftt), 

D R AM-iM* ^ ;kd fc «> K ±15© £ 5 ftft 5 #V * $r|feR 
L, IDLE 1 9 6 0^S^^-r5 o 
[0 4 9 6] 1146ffl!J7l/y>a)!J7^1934 
(il£M(£#7>?.T'fc«?, 1 5^^P#afe9l:-|5| 
©@£u-K fl»DRAMtf«S*ICJ;t)g* 
ofcWhf^l/y^a gjfcff ^-£38£ $ y 7 

t/s'^g^SltT^ftS fc, C©g*i40l 4 7co*t 

J Isyi/tl/YYS.. ^ffi^RCASONl 9 8 Oftlg 

[0497] 0148 fctt. Ma- K7*-AT»H 1 
4 6<0#a«l 9 2 4<Oftfp^$tLT^5. 

-(191 o^7****lf RTf3a»«rft»a;frife£, 

KtCfflV^ttfci/^-Tp/HiS 1 4 9tC|ftBJ$ft 

[0 4 9 8] *H€*lOS**fT#tt, ^(0^<DZfy 
^tyT-f^$t4 1fyh^oW5 0 ifi©2ty 

■<iT^y-^<Dy\tL2 t'y hilft^#2 4 «k 9 JE»f $^ 
52 \fy h*7^^HlR«>fjtlT^5 0 fr^<D^#S:^ 
9 2 4tt*lc-?rft-? f tic7)g*#©4 
tfyh©«4rJt«U *Wffl©»*#»c7^-fe^*ffnr 
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h$tl5o V*!)7W 19 1 0~~<DT-7 

(•€rtim©S*#<o±^2 try h^a.^-^) ^p,*p 

-I3$©±&2tr y FT-^tDr^asffictH* 
5) , Jfctffc&^fcAiM**. (£T<D±ft2 tfy Y<D7 

[04 9 9] H 1 4 9 f±, ^^-e^COS^fC^-rST" 

tv^s,, hi 4 sicffl^ttirv^^v^/w 

©flfefetjaiSftT^s. iis^njSCT-e^awF i f 

O1 9 20, 1922, 1 9 3 8^ W*^1 9 3 6H 
4g32t*yK »$3 2»7-K-C*>5. £ COllii $ Kgft* 
t ?B*$H5lHlK^. y 7©P|©IV^f»ffi^^iT 

[0 5 0 0] ££fc*$*V-a*a4Jj*-h*fifcH: 

3. 18. 9 ftbrtv^— 

^2 3 9tt, 3^D-byf2 2 4©«^, y 

v?, CBu s<DftUkfm7*— At (Offline 

s f y^CD-r/Pf-7 P W^ v/V^&ifWc&CDy- n y 

[0501] 3. is. io ^gp-r^^-^^-^sij 
im 



y KT-ftffr 5 r i: ii^mxhh. a7n 
•feylHcBu h^o-fey^c7){5S^^y X-7VV& 
&mir 5 r i tfs-ct ${e»* * y *ftfe9;t * y -=? y tv 

y^ay^y^ytf^^/w^tt, 07-f^-f 

^— ^5r^-f5fc»(c:3^t 0 a-^^77-c y?^y° 
n-feyf-w— »S:»j*fS e n^o-fe y-y-|c{4, 

5 0 (EIC) 2 

3 8lt a/ntyfOPCI Bus^-^Bus-- 

§52 3 812, XtsJVf — 7i-7^^f y^2 5 2ri>fe 

^\ #-jvfif 2 4 9frb<DmXlZ.!fcCX*X h 

^ * y Kx-y" £S=# &tf H#{£ PCI BusiC?^ 
$t LTf^Si-rSo PCI B u s—<DT^ir7;{4 x 
"PCI Local Bus Specificat 
ion, draf t2. 1" PCI special 
interest group, 1 9 94©»to 

[0 5 0 4] >?-7 x-*m®n2 38I1, A 

^-f ^-7x-X^^ y^2 5 2 jtf-Jvfif 
249*^<DPC I%9 9l»0«:«^>PI^S^trfflftt- 

Jfco^'f T"};:^ — h=f7°D-feyf-cr)ifTj^T 
O^ts'^a^'fyR^liJL^ lfrt 2^0 

7*— ^«fliaS2 3 8 lei 9 
*S$tl5 0 jg^$)5-rv^-7x-^$fj|8l§)52 3 8 

[0 5 0 5] t\-U4>*-y*-xm®®2 3 8©flf^ 
{^fi, ±Tc0a7 p D-feyir(D^*^ 3 .-/wcDfc*ir{5 

a^^y hofea/^y ^cdt K^?7tr> 

^£«#-*5**"y^*s^yMS£*;h,3 0 i©^y 
f V y*tt % T * * ^ «: gjfc-T 5 * -yHc» 
SWTfcSc *«'fl'*-7:c-.xto«l»2 3 8#** 
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wpci Bu s<o^»)9ltiiST Pi'xaWtfET't 

t^PCI Bu s"«JR 9 §I^Sr^*-rSt> 5 -ooy 

'fyf 25 2 tf-^U* 2 4 9*>P>S*$tLfc 

y^-^MW&z 3 8 HtSlllttK:** ha. 
=• -y h SrffMKl L, £"tCD4H!f8T 5/ tri^^rE 

L<^>»)ii:i- 0 

[0 5 0 6] ^Dv^^yhaiyf (MMU) 
(El 1 5 00)9 1 5) 14, 1 %^<D)Vy^T^-^ Y'^v 
yy (TLB) asS^KftoTV"^. T L B fSSffi*^ 

l) tt:«!:fi«TK^as4itb*i*i:, TLBiim 
1-S&J8T KU-^/6»TLB$Xff# (7KWCvyf 

[0 5 0 7] 2) TLBlCfi, gE#^>' M» 

3) J»aMfc:IE»TKW*#-5*&ftfcl$» TLBOi 
f h y lev y^-f 5 fc-sr y Lfc^t ^ h y %m%}ikt 
5. 

4) £fc3Mb : t^TCTLBi^ MJ *#BJMW*«. 
[0 5 0 8] 5) «E*at : T LBxy h JJ tD&M^tfa 

mr K^ii> 4 try hT v^x^-xvgtz-m&ti 

3„ hW^UlJBl^kftS. 

6) TLB^Vb ^{Kg^ST 

14, 4 tTy hT KU^-5-^-C*#a*^5. 

TLBrtw^vhyjiEli 5 ll^tJ:547t-Ty 

mc*oti^3 0 ^?ti©w«iyhyit 2ot* 

y h<£>{&®T 2 0 fc*y hcD^ST 

6 7 1, *ft*»&*ww-s*a**—i &>#ftt&jrofB&'> 

f"-<X(44K/W HT-fcSo MMUrtCl/^^lt, ifc 
IfcfcJB^kftfcl Otfy M-C©7K^(I^?^ 
tt5©lCffl^5rt*Tft5. dtUC^oTTLB^ 
-5?li4M/^ h£T*f-/K~- h$ft5 0 
fiioro^-c^sfcae), t^t©T LB^vh yjanf- 

[0 5 0 9] TLBCIl "Least-Recent 
ly Used" (LRU) fi&TA'rfy XA#;flV>fc 

Sbft^hy^fca^fcit, wa*a:vhyic±#f 
1" SMdmtftxv h y £##&*;}i£„ 
[0510] El l 5 2 «T L B Jttt»f^roj|fth,Sr^-t-. 



^fMhivtc^mr KV*880f48 8 1~8 8 3CD3 
OW^tC^lt btlZo Tfil2fyh8 8 1 
-S?rt©*7*y htf>SB£-Cfc5fc#>, *fJE?-<&*«T 
Kl^lTy h8 8 5^^ hfej^ft.*,,, ftffllO 
ITy h8 8 2tt, v^^y MCJ;5S£**Vfciit», 

^ih^XlCfcoT^-fey h©gB#*\ -<-v ; #-^ 
<0SB^-*>«)V^-f^>T*)5. ^^1/^^8 8 71*3© 
-fefncDtttt, fc'y hri*^— h©SB#-efc3fc 

L BJt«lc:fflv>-c ftv^it*^ t v> 5 r. t £^ lti^ 
5o 1 0T K^tTy M41 0-r** fc*y hirni^/U 
t "ANDED" (Ift3ft) TLB/U«^7y7 
<£>7t&KT(£l 0 fc*y h<Z>{ESk<— 8 8 9£-£;i 
£■> {53®T Kl/^Oifi 1 0 fc*y h 8 8 3 fi, {giffl^— 
•^S-^-8 8 9C0±firi 0 fc*y h £ LT*V U-^ hfCfflV^ 

[0 5 11] ^<Qi 5te£j&$ftfc2 0 fc*y hflXKfflk? 
— ^#-^»TLBJiU*e>*b5. m^^ii/hyoiot 
-Srf-5 t, TLB tt^tSW^-^lf 8 7 2 i 

*y ^7^8 8 7 ^m^v^r^s^— -^s^^ 

tfam-<— ^#-f§-8 7 2<£>±&1 0 fc*y hii$5 
ST K^8 7 3©±fil Otfy h£ LT^ He 

^v>e>tv5. mmr k^8 7 20^(01 0 try hn, 

(^f 5v^^e y h#l ©*§£) *> 
{KfflTKU-^ (t^^ fc* y h/450CD#-g-) ^CO^ftt^ 
H8 7 5 }cSJ?$H?)o fear Ki^^WTte 1 2 f y 
H8 8 5ttjR3Br K^*>b^^ h(C^^.t)^5 0 
[0 5 12]*^JC, vj/fl^LRU/%77 8 7 
6*5Hff$tV, vyf^ti^K^Oftffl^t. T 
LB$*f4, A73-T>-^-7i— ^^-Ty5 L 2 5 2^ 
^T-'fif 2 4 9^TLB8 7 2fc#£L-fcVMK2B 

h^^y 2 0 3 »'<'-^-7"A^>6S*J ftfcfi 
H^ffea^iftSr^syf-L, ^rtiSrTLB(c#tJi^7i; 

[0 5 13] ^ur±*^ h*4y**V <D/n 

h y tt, 01 5 3(C^-f<t 5*7*— ry h«D2o<03 
2lfyH7-K-CM$tvt^5. 2#@W7-rai 
ST KU^.<Dfc«)0±fi:2 0 hfy HSrflWJU 1 2 
t*y h{4^•|^I$^^TV^^ 0 #jtSr5{R«T K^O±tt 
20lfy hft**UO!7-KH:^t,n-CV^. Tffi 1 2 

fyHctt^rsd (v) try h£#t&*«riB (w) *fc 
tt"9-K*>9" try ha^sft-c*}^ j^ocio 

try hfi^iW^ttTV^. 

[0 5 l 4 J ^-VT-7;^y V y S^ftKT 
LB^^hy tmcffimtf$-£iix^z e s±-?y=f~-y 

il^ - ^ W y 2 0 3 rtW^CD^- v?(Cfc>/c 
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t\*&LT^te\<\ MMUKfi, 7 7 hy*T\££*)m 
5„ rtlil, 3/ntyf2 2 4i)S6 4K/^ Ff^fX 

[0515] T L B 5 X&CDMMU&f£f4, £ 5 
01 5 4<D6 9 OlC^LT^-5, 

1 . TLB (C#£ L-^V^igffi^— S^S-f- 891 _h(D^y 

1 3 fy h<7M ^x-y^^*^i"^ 0 

2. <<— -y'r—zf/V^^y^^s 9 4, 8 9 6©J:fi 
4 tf y h 8 9 4 -i- v^-TW/lf-f y^895 

[0516] 3. 2 0fyh oy<— v$ 
8 9 5 t^<— S*r— 7Vl"f l/fy 8 9 6 9 t* 

5/ h U fTtt 3 try h M 0 0 0 ^tg^-fS £ t 

h 'J ©117 KM8 9 0 4:^1" -So 
4 . V>t— -?JV3.-y h y CO^JST K 8 9 8 frb 
*M>» M^!>a»P>8/W h4rR*UJ+. 
[0 5 17] 5. 8/W hC0^<— i?7— -fjV^V 9 

0 O^PC I /-<^-^iS$n-fci:#, VAL I Dfc'y 

1 KHr y h S;ftTV^ff£»<<-5*S*HtT L B $ 

T\ & KM® ft s?-r -TViol v h y tcS 5 *T^it 

[0 5 18] 6. t y ftZiSM^— i?&%-*G-fZ><< 

36±4at^HJilTLB.K»*a**ia. UrU^i'h 
yf4LRU^*y7r 8 7 eiCfcotJK'f V h SflfcTL 

Btefijcg^ns. *tia»e> t l b (Ditmimtrnxm 

£34**15^ LRU/<y7r 8 7 6 tt3E£f£ttS„ 
[0 5 19] E I C 2 3 8l:i:ftS£*VC^3^yf 3.7 
7^->3y8 9 2S4, 2 0fy hWfiiB^-^t'l 



(vpn) (c*fU &©#S£«rfl!^5. 
index = ( (vpn>>Sl) XOR (vpn>> 
S2) XOR (vpn»S3) ) &Oxlf ff ; 

SI, S2, S3tta£ttlCy*n^5A-5rtlfc 

[0 5 20] ^-^-7VW(0;S§M3t;$5 4 K/W h<7) 

©7 5'^ymsw. ^-^t-7Vw4, 
^T*^S«t5Jc#{r^*< iow«Sb (nu l 
1) ^Mfc-S^/CV^. 

[0 5 2 1] 77 h7^T^S*^ M^-'J l*3<D<<-i?£ 

— TVi^Vh 'J Ii37nti/t2 2 4<7>TLB(CdrY-y 
v'3.$ixT{4V>(t*V\ mt4, MMUrt^TLB*^/, 

[0 5 2 2] «t»fl;^^^yH4«3»^*S:3l#fir-r 
f y ht*(c«|^$H?.{g«^-v J #-^^U MMU 

B±-eWc$tt-5 0 TLBxyHJIc-ryftSi, ^ 
[0 5 2 3] **£<&*»flsfl^ttl,%a»fc**ifcfc0>T 

LBit^J:viiVN77'i'^-y7 1 ^^oTV^ 0 HSMb 
#H^T-fSt, MMUHt*Sbflstfy hSr^yTU 

^hyfcjLottibft&VvSI^ r^Sr-<-^7^;H> 
t^5„ MMUBx?-f|^mu 7^h^:L 

^Jc^fl-So MMU {47 -Y K/HKftlCA9. ^7-^5 

MMU $ tLfcSfc ♦) §1 # a» e>ff tW^*Srtt«> 

[0 5 2 4]»*(i|L*ftv-^SHfc 

tifcli*{ct>^-> J 7^/vh^a$tL5. Wy^-7 
*— ^ftMlplBJ (E I C) 2 3 8J4, -&s<*^7 
$^T^-5A7J-<^^-7i-7.^-f 2 5 2 iJg^ 
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5 2iHf**-jtf^1f 2 4 9fc(D=«5^ = 
i ciffupc ig^rofc&fl^^ic^ 

ftT^5 0 5(CE I C 2 3 8 H x -tBts^Q.mz.y-< ^ 

[0 5 2 5] 015 011 yrU4ls? — yx.<-*®m% 
2 3 8<Dffim%7]kLX\'^?> 0 I B u s mXtt^mtn 9 
105:1?), #*^SP9 1 0«g*©@Wtf5^fcttCL 
T (PC I *7t ^SftlWev^-zl^g 

BustCBust 
#oT^.5-j8:/«.xf&J$lg|5 9 1 l^&ibftSo RBus 

[ 0 5 2 6 J PCI /<X^mfrtlit I B u s m&lt I B 
u s K?-f>< (I BD) 9 1 2l£«fcoT$i:b;}x5 < , 
fc, PCI's©RBusg*ttRBusk>-/'! (RB 
R) 9 1 4te*o"t*Bl$;h,$. I BD9 1 2 tRBR 
9 1 4M5MT KW<S\ #)ST K ^jg-f^ * y -7 
^yha= y |> (MMU) 9 15frii.5 0 I BD, 

RBR, ^rtL^?>MMUIi, ^fteftPCl h^y-yV 
->3^^g^-e# -c, :ti5)apci^n-KW 

$B (PMC) 9 17l:J:ot±^§tl, a>-hn-/W$ 
tl5„ I BDtMMUfiPC I^WL«D^.S:l|jftU 
RBRttPC I#t»*©^«r5*i-5 0 
[0 5 2 7] Sijflsoopc i h^e-WSPSB (P 

TC) 9 18f4, ^yhtUa^ntyf^rK 
U^^ixfc^TroPC I hyVlf^a V^toS-f 5c 

^TWte^a— /^©T^-fc^SrnTiBfc-t- S„ PT 
Clt K^ftfdCBu sf-^jPMCgSt'PC 1^ 
^^Sfcft, P C I f-?/^ fy©3 y h D-;Hi 

[0 5 2 8] E I CW-^^t^ya- WtJ'sTK 
l^£ftfcCBu s h7Vf?->3y(ii?CBu s<< 

-^XU^^Clfy K^ig-To £*i6H#lpCBu s 

y^S/3>-<9fc*<D/N o yx^£/£«t?-x:^(4, PM 
CtPTC<03yhp-;VTt*fflt5/^r^4j?gt 

^xy? (pgc) /U9 2 ur^oxja-a^n 
(cpc ^!)?^fi^©g» > 

h©it*ICPTC©3y7-f^ 3 .l/->3 

[0529] 0155li, 1150OIBus 
9 1 2<Z>fl|i££^UT^-5 0 gttAftfc I B u s T 



9 3 0„ *7f- h 9 3 1 filM */K/D&* 9 
U a* hn— fruity? 9 3 2 tcmteW§-fr%£1- 
5 0 ^ffi^-^S-^-Sr^i-S^ 2/^9 3 0O±ft7 K 
V*t?y M4#!7^* 9 3 5tcn-K£;h,5. (Rfi'*- 
s?##tt, 9 3 6l; 7 yf-Ztitztom^-V^zm- 

MMU 9 15 (■ 1 5 0) ^jSfetLS. 

[0 5 30] mm^-V&^bT&rlRBT K I'* tfy h 
14, v^^ 9 3 7tioTS^^, PMC7 17 

(010 2) --©PC Ig*Ofc*W7K^9 3 8^ 

9 3 9 (da- K£tt£<, ^!)7xyf«ff|JH 
&57J'7>'?9 4 1 bT Y\s*7 y<?b%Mm&$ 4 3 
&fflv>5, PMCA^ig^tifcx-^te, x-^^yj; 

SB*^/i>Sr*i-v— *i#{CF 1 F0 9 4 
4(Cn— K$tl5 0 7-?#FI F09 4 4©ifflS5) 
-C^fflWfg(C^oT< 7yf945, 9 4 6gA 

*ttiLnS?y£ 94 6l4IBus7^y L'yS^i'Mf 

[0 5 3 1] ■ : f*3Vhn-/^a 5 /^9 3 2f4, fttt 
it©7 Yl'Xb7 ! -9mm<»\m.$m. * 

5o <SS^~^##* >7 V* 9 3 5 IBus7Kl/ 
^^bO^-^ffty h-Cs I Bu s h^Vif^v'a 

^BB^t*lCD— K$ix5o o try hfiyy? 
©-tftlO t?s> K4^}C§{TAH'5T Ku*a»e>< 5. 

tf y h 9 3 7^5 1 {Ciry h Sft-CVftHSttA 

t'y h^l (r-fes/ h$tl/So 2 0 fcTy hcDfitfiMMU-l' 

[0 5 3 2] mm<DW)ftx\ fcM^-i?mmt®MT k 

^**©«"?fflV^6>*l«V\ L/J>U IBD^-^ 

y*V*yh h 5 l oo«*asf?j3H 
b<DHM.^<DWMJyj7 ]) * v M4Hl^w<— f y^y ^ 
Ty7"-f-5fcJi>jC, -v^^ bfy h 9 3 7t>mXf%^<btl 

[0533] K^Ht, mm, MMUmWi 

Vvm^-V^Zte-ttcmc? 9 3 6. -7* 

^t5rEL<M-a'1-5fc«)^rfflV^P 5 ^^ / 5o W7 
KU^TJ^V^ 9 3 8(4#iaT K^?yf9 3 6A>b 
n-K$tlS„ i*ittPMCi»e>!7-K«SiS$ix5fctf 
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£iSx.<k5 bVT^Zfr&faZmm-fZ, -r^9\fyV 
it, *'}y9<Dt°<n¥y hiStttfcfcfflv^ftS^SrWBf 

n-/vn^^g 3 2 t^f -§-£fcti U 2o©f-^|ES 
iifiCPCIg^TL, i&StO&DTSffcftTK 
L^^fcl^-f 5„ #7^7te$rU^T K^7.^&& 

[05 34] ^h*^y^9 3 9lt 
3 VCOfe^T' I B u s^— 7> hmkMza—V'£thZ>6 

&2-oVXf\zt£Z>bs Y °— fruity? 9 3 
^fcfflU rtv?2-ocDr-*"l£i£fL PCI hy^lf 

10 5 3 5] 7'!;7i5'f7Kl/Xl/^?943B, 
#^n-K$^5 0 |<iBus Yy^ti/ss-fiM 

KU7.<Z>iiiJti!££*l,5 0 WfiWyffSi, 7^7 
syff-^lil Bu s 51510 *^§/c-f-<oteiffiv>^tv > 
Mc&iZ. 7* V 7 x. y 7 £ tltz V — K©&(D 7Kl/^tPC 

[0 5 3 6] 7°U 7iyf*l)^ 94 1IJ4 \Zy 

^ht^y^y^yh^, $E< IBus b 7 
^i'sy^^iyfT K^ir-es/^-fSt, 7*y 

*tt*^M«tt**5J:5i;:fc6. ftfc> ib us h 
9 7 5/ 3 V^7*JJ 7 * y ^ £ ftfcx- 7 0-§B£(t 
Sr^Bil-St. S**ixfc^-^ h©fi*!47*y 7 = 

[0 5 3 7] f-^FIF09 4 4|i 1 8 7— FX 3 3 
h©#RS!7*-;^;i'-F I FO-efeS, PMC 
*>?)©f-^l4, 7—9 7xy?-<D-%X*h%fr 
S^fc^-flfy YtMCF l FO\Z%$ £Sft5„ FI 
FOO5feSS^6c0x-^t4, ffiffl^rH{C/,c5^-^F I 
FO*»?>K*a$tH Bu s^JHtjttS. ^-7Se^W 
Lft^££$-fSa^y7ttc 1 ki^UWU 
I Bu sT7/ i"yV*i'YMJ]*&£tZ>. h7Vif 
7 3 ^#7° JJ 7 1 y 7 £ tifcr- * v vfflfc $ ft 
5^a-(r, 3yFn-^Bi?y^i>?,cflI^|i, FIF 



fctt*W L n S? 7 7 iC^-K. 5„ 
[0538] 0156 ft, @150(DRBus 
9 1 4©#ijg£7Fl.Ti^5o 3>-hD-/Hj:2o©W 
$9 5 0, 9 5 1 bOfUX-xyy y hSftSo #f 
#f&3§9 5 1 f4RB u s^y!?-7i-^5r3yh 
n— AA7Kl/^7 5 2liRBus^-7h© 
Ut&Xy y^£ft5„ * hO-ttb-?tl<07— T - ?— 
Kf4> h ^-7*/V t *l: F I F O 7 5 4 ICf # iZ; 
*ft5. F I F09 5 4jSS^1- 3 «fc 5 

9 5 lCiotr-Uf-f #® 9 fB&ftx * 
-#^tf#*ft£U:©!7- K«r##&*fcv*.fc 5 fc-f 

[0 5 3 9] ## &#n^y7 9 5 1 tt, fflUffl&fl&ft 
-!§■&?> LT^y^fiSi 9 5 OlCRBu s^-* h^HS 
*&«r»*nU #7M1f/J**:ft£Lt<7>!7-- K«r#*& 

t>*l*y Hi*D^9 5 7fcn— H£ft$. IBB'*— 

9 5 8#jS<*ft5 0 &S^-y##i:{KffiT Ki"*©T 

ffitfy htt^^^lr^otff^^tl, *7>-79 6 0 
Ko- K$ft, PMC"-©P C I ¥kifr.<Dtz#)<r)T Ki^x 
P C I R#©Weft©!7- Y'<Dtz#><DT 
-?bVM^-7*Ml, -f^TcOPMCM-l' 
7i- * =• V h n— /Uff 5 ^ ^ >-3 -y y n— /Ha 
•7?^9 5 0l:,totF IF09 5 4^67D?77i7 
h$n5„ ^>f7«Sg|i, ^-ltt^LT7 7r 

[0 5 4 0] #$&^ffij»9 5 ltt, r-7r^t/V 
£fflV>TRBu s/<-7x H<D*IT*^W-t5. f 5i:F 
I F09 5 4— <D7 t —9<DTi—Y**t>±U ^-f^tl 

'sftWrnttf-f f 1 Fo^t/^t-epc 1 

[0 5 4 1 ] @l 50{rSt>*M9. «!)^-^yh 
^•-7 h9 1 5li, I Bu s K7-f^ (I BD) 9 12 
tRBus Ui/—;<- ( I B R) 9 14 (DtzfriZiSM^— 

^fl-^btta^— ^#*^©«*srfia+*. 015 

5. 1 6^>YV<D^^/Uy9T-9-yf Y'/<y7r (TL 
B) 9 7 0lt TLB7K^P7y^9 7lHA* 

x-7^(7^oTW7J^ <9'M-fo &Mm&$*hx 

V^TLB 3^ h n— /Pn v?-y7 9 7 2 ii, RBR*7c 
ttl BD^tT L BT Y"lsXu*/y9l<Cs<y7T ZilT 

tTLBK±o-Cff*>H5fRR*«R+5. WiTL 

U-CfcSo TLBA7JT KW-^(07-^t LT(4, IB 
DiRBR-f^-7i-7> (Hs«^*ffl) , -<-v?7 
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ffl) £/ctiTLBT Kl/^n^y^rt©i/^^/j;^i 
£>3o TLBte, TLB3yhD-/i/nv?^^|c^f 

I BDtRBR^S9 3li- 0 TLBIi 
ftfcftifiT^-fe^^n^: (LRU) ttfiOfBft&fc* 
L, rtv(4TLBT K^^ni^y^{ctoTf4S§-ji^ 

[0 5 4 2] it&fcMtfik&iLtcWrS; TLB=>>-hn 
— ivn Vyf 9 7 2 [iX-^-^i/j^tXn ^ h n 
-/Ha s? y * 9 7 6 fc P C I SjftSrHte"*-* «fc 5 
Hi*-. ^-^f-^PT Kf*i-f* l — * 9 7 711 ft 

#^Sr-bttcpc I7K^tft«t8, pci»*i> 

9 7 4->7 5'f^3 0 R**H5fi»TK tiers' 

^Sf*iTLB7 KU-^OiJy^ 9 7 7— j&ibft, ^:<D 
^^-^7' / l /7 ^ tx - iy d— yuni^y ^ 976 

W S ?)TLB3V he— yUD-^;/^ 9 7 2f3 v TLB 

[0 5 4 3J SCI^(D^?<lfiSCI^f)<OU 
^^«*»W*©*rtiK:SiaiJB*H5 9 8 0. fl|#tt 

-f^-7x-X98 lfi, i?pCBu s^*-^ 
— P> T L B £ V"r— zf>vi$.4 is? yt * y gjfc^ 
CDT^-fe^^rx^— K-f5„ TLBT^t^li^tiiL 
#ffl"C, ^SrffSitfefclTLBrjy hn- /un{?y 
.*«rJBi^«. ^-^r-^jK-fv-^fi^fflL • ft* 

I0544] 3. is. 11 mmjyt—y^—xm 

(P I C) cD-M^PffltC^LTV^o P I C 2 3 7 
- Ktt, 

1 ) KxVWTJ^- K : ~<£>^e- Kt?, f*— *tt*»if 

-^Tt, J^fflH£j££;ft/£o PIC2 3 7li ) fctWy 5 
-*Kl*f U&SSt £;}x5*-f 5 >-^T'tti7J^ n -y ^ £ ^ 

[0 5 4 5] 2) t'x^A**- K : r©*- K"C, X 

3) -fey hn = ^^t- K : Ktt, IEEE 

1 2 8 4$i|Sia£i£$;h/C^5&ifS:/n h=>MC&\,\ 



P I C2 3 7^ iJgfcJEft, F^SPx-^y-^^B 

[0 5 4 6] 'Slt&tmfi'f—tKDy-XbLXIX LM 
C2 3 6 (ABus&fflV^) , R024 9 (RBus 

5. PIC 2 3 7ft, rftfcCDx— * V—7,frh<D YJ 
2op;i©y-^ii:7^7^ i/lcfcofcifrg-KiB: 

CBus, ABus, RBu scD^x-r^y-r^T-lllc: 

[0 5 4 7] ^r^^—MtP I C^rtgfU'^ 

**#£*;h/0>5fi|J|aCBu s ^y^ 7l ^ 9 9 
0cOaV|>n-/UTT'|&^-f 5o HfC, CBus^y? 
-7^-^9 9 214, 3 7"Dt 5 't2 2 4£?>LT^>a 

5o ABu s^y^-7i-^9 9 Hn-*/M?|) 

*991tCBu s^^-7x-^9 9 2tt^irt> 
/Wh-WKFI FO^^t^5(tj7J7-?/^ 

»9 9 4ityhD=?xMW9 9 7t^y^-7 
^— ^"f-So ^tlW^ a -;U9 9 4 1 9 9 7i4(±) 
^f*-^/^©ft<B5F I FOH-*Kl^ h^gg^ 
ffi-T. ir>-hn^^^$ijHtB9 9 7(4s Jia^/V^Sr 

3 >• k d ->v-f % t z je> \cmm± > v n ^ 9*- ^ ^ v 

A7JIMMIS9 9 8(CfJ, ffiv^nrv^Sv^^^^tW 
fc*T^A7J»a9 9 8f4A^3T-^-'^3.^ ; , h 9 9 

{Cl/<^ h-fo#|H$TF I FOfc»*ii**l55*-^ 
£^ h7^f KA7JF I FOif^tS. 
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[0 548] x-^-f-vg 9 6\azm*<D%V>'?& 
9 9 9F»J©F I FOVmiftm&^ftZltlbkzm^ 

ffi«i$^5„ ^£7*y >vi-z-&mihzmc s 

3/n-feyt24 KilSjPEG3-^2 4 1 £fflV> 
^XCi^^J:© J P E Gf^^-fy hV^-v? 

J: 0, 5 h£_h 

[0 5 49] ±ffi#®#HlSWi#3S§J!(D 1 ocoSllg^ 
[0 5 5 0] ft&A 

-HcJ; Vftiotii-t^-x^/v-y^yy^^y—ya y 

1. Hff^yvw 

2. iSHHry btx-f jy? 

3. u^^-fe-y h 

4. rnwik 

A. 1 -/&<Pf 

Al. 1 ^7u±yVcn±y YT y7{m-t%-im 

m 

•ay Yu-iv^t n-^/PDMA^M^t^t© 



Al. 2 ;W3>7-f^l/- >a y\m?f 

B#i^ 4<3y7^^-> 3 y^ti4i\ *v^-/P 
co3>-y^ yu— y 3 ymmtmcpo, dcc, oo 

B, OOC, MDP, JC, RO, P I C<73(M"C$>5 0 
Al. 3 -ttDfifcCDWv^^cDfSS 

zyyicxywifetstiz, 

fr<D^frteZ yy**2&fe£ v t>mcw>&£tiz>. 
2. fS5££;h,S^£l^**£#o^v^-/Mc, 

f g uyxfvmmicmfennz,, 

ai. 4 i^tiroii^U^^yKffl^-f^ 

Wr&iaX ttl&-rZ>*'<7yY*~-tf-f-J4Fi-±zft%R 
yy^^fv— -yay&fr-f, ^x/—r;^e—Yxm 
2. ^yyb'<D^ny±— ?y b& r^o^co^j 

{ci|#fb$^T^5*^{rtt N =7'nir?t|j:^f)t 
•<7yK07*-vy b%£.f&tZ>o *-<yyb'Bb* 
^yyYcic^^xteffimttxhz. t^vKAto 

y b'*-%i-4 -fit/<y y< Ktc t£ <o , t*- ^ ^ 
r^ = V hn-7a#t)tlfc7^— ?y h©^7^ 

Al. 5 MM-fr^tDJcm 

• ti^<Dmftmm*&i%<D%%X'W:fe£hz, 0 

■^HBHelvetica Bo 1 dfttH' 
tlXb^Z). 
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•^^7>f-/l/KBregister. field 

• I, ^mm^^fiX^^^V-Yby : -^V 

• A__d eskriptor, B__d eskr ipto 
r*5£tfC__d eskriptor ttSHHS-SMb $ JxT 

• " X : Y" ItXt YC0ii*§^i- o 

• M @X" tt^yutylrCCH/^^SfX^to 

Cbus (X) ■ BC/^^i/-> 3 yx^tf 



*cbus (x) n iic/^*-<u-*/9yxcJ; 

??" tt^W/jHBU fc5W**£offi&^o 
• w set" IJ^^^Iz-S/gyi/^joR 

A. 2 
£ : 

1. iSt^-KttOxCtOxD 

2. BftSttftlttTK^^Wh (i"ftb*>, S± 

3. T*aAl^-^**i^ri^5^KM:^^**Six 
5. ^SttA*^^/wftlcj:9SiSSnrv^o 



~ Pixel Organiser IS^-i *f) : 

if I.R=0 then 

po_len <- 0x0000: I. length 
endif 

if A_descriptor . S=0 then 

po_dmr <- set. 
endif 

po_said <- R 

po cfg,mode <- sequential — Set going 
po_cfg.dst <- mdp 

— DCC in "normal32 M mode (default) 

—Operand Organiser B {^"^ V S K*^- if B ) : 

if B_descriptor . If ! = other then --not blend 

oob_len <- po__len 

if B_descriptor . S = 0 then 
oob_dmr<- set 

endif 

00b said <- A — special for compositing 

i f B_descriptor. what= tile 

oob_tile <- 8 
endif 
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oob_c£g .operate <- operate 
cndif 

— Operand Organiser C (t^<y> V*^-jti~4lFC) : 

— There is no implied other data type for this 

bo used to specify any bit offset in a bit map_attenuation 
if C_descriptor . if != other then 
coc_len <- po_len 

— set up the ooc drar 

if C dcscritor.S = 0 then 

ooc_dmr <- set — including bo 

cndif 

ooc_said o C 

ooc_cfg. operate <- operate 

endif 

— Main Data yath: 

if B_descriptor . i t - other then — blend 
jndp__len <- po_len 
mdp_bm <- B 
mdp_bi <- A 
endi f 

mdp_cfq . instruction <- I. opcode 
mdp_cfg.blendgfen <- (5_descriptoi:. if '= other) 
mdp^efg. decode < 1 
mdp_c;£y. operate <- 1 

— Result Organiser V&9k7h-Jf-M -tf) : 
if R__dcDcritpox . S 0 then 

ro_dmr <- set 
endi f 

ro sa <- R 
a q <- sequential 
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— Pixel Organiser: 

Operand A only inakee; sense as source pixels, anything else 

— is probably wrong 
if I . R — 0 Lhwii 

po_lcn <- 0x0000: I. len 
endif 

if A_des crip lor. S - 0 then 
po_dmr <- set 

endif 

po said «f— A 

pojnuv <- c — I & F tables 

po_cfg-mode <- CSC 
po_cfg.dst <- radp 

— T>CC. sfitup: 

— should be other 

if 3_descriptor .if = other then 
dcc_ adrir <- R 

dcc_ciy2 . cache_iiiis£_inst. <- n_desnriptor . C 
if I.M= 0 than — sj.uq.1* output channel 

dcc_cfg2 .mode <- single channel CSC 
else 

dcc_cfg2 .mode o multi channel CSC 
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endi f 
end if 

— Operand Organiser B: 

Operand B should be other, anything else is probably 

— wrong but do it anyway 

if B_descriptor.if other then 
oot_ien <- po_len 
if B_descriptor . S= 0 then 

oob_dmr <- set 
endif 

oob_said <- B 
oob_cfg.opeiate <- operate 
endif 

Operand Organiser c: 
-- Operand C should be other, anything else is probably 

— wrong but do i t anyway 

if C_descriptoL_i£ 1- other then 
ooc_len <- po_len 
if C_descriptor . S~ 0 then 
oocdmr <- set 

endif 

ooc__said <- C 
ooc_cfg. operate <-■ operate 
endi f 

— Main Data Path: 

mdpefg. instruction <- I. opcode 
mdp_cfg. decode <- 1 
mdp_cfg. operate <- 1 



-101- 



4 JPEG^ 



if R_descriptor.S = 0 then 



— Result Organiser: 



ro_cfg,mode <- sequential 



endif 



ro sa < 



ro dmr <- set 



> R 



3. t^ayft^fttSo 

• 1, 3feSV>tt4^^^^ 0 
A. 4. 1 #* 

• r o_i d r : UAHIkftTcft U'SJ** 

• r o__c u t : ffl^3*y h \s*?7s9 

• r o_l rat : ffiXMMBRl'^** 



— if there is a register set targeted at anything other than 

— the JC, PO, RO or DCC 

if (D.R= 1) and ((^descriptor , topnibble 1=0x5,0x6,0x9 or OxB) then 

@C_descriptor <- C 
endif 

— Pixel Organiser: 

— Operand A is the source compressed data, and should be a 

— byte Stream 
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if X.R - 0 then 
pollen <-OxO000: I.len 
endif 

if A_de scrip tor . S = 0 then 

po_<±mr <- set 
endif 

po_said <- A 

— if there is a register set targeted at PO 

if (D.R = 1) and (C_descriptor . topnibble - 0x6) then 

@C_descriptor <- C, 
endi f 

po_cfg.mode <- sequential 
po_cfg.dst <- jc 

— DCC setup: 

if B_descriptor.if = oLher then 
dcc_addr <- B 

— if there is a register set targeted at DCC 

if (D.R = 1) and (C_des crip tor . topnibble = 0x5) then 
GC_descriptor <- C 
endif 

dcc_cfg2.C <-B_descriptor . C 
dcc_cfg2 .mode <JPEG decode 
end if 

— Operand Organiser B: 

— Operand B .should be other, anything else is probably 

— wrong, but do it anyway 

if b_des crip tor .if != other then 
oob_len <- pollen 
if B descriptor. S= 0 then 
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oob_dmr <- set 

endif 'i 
oob_salc <- B 
oob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

— Operand C may be a register to, set. If it isn't it should 

— be other r anything else is probably wrong, but do it anyway 
it {R.D=* 0) and {C_descriptor . if l« other) then 

ooc_len <- po_len 

if C_descriptor . S= C then 

ooc dmr <- set 
©ndif 

ooc_said <- C 
ooc_cfg. operate <- operate 
endif 

— JC setup: 

— if there is a register set targeted at JC 

if (D.R = 1) and {C_descrip::or . topnibble=0x9) then 

Gcdescriptor <- c 
endif 

jc_cfg . instruction <- I . opcode 
jc_cfg . decode <- 1 
jc_cfg . operate <- 1 

— Result Organiser: 

if R_descriptor , S = 0 then 

ro_dmr <- set 
ejidif 

— if there is a register write targeted at RO 
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ro_sa <- R 

if (D.R = 1) and (C_descriptor . topnibble = OxB) then 

@C_descriptor <- C 
endif 

ro_cfg.mode <- jpeg 

ro_cfg.chan <- I.M:I.4 

ro_^cf g- ups ample <- I.S 

ro_cfg.cut <- i,c 

ro cfg. limit <- I . T 
A - 4 - 2 'Po_idr:W*i*fccftl/^^ 
* : • j c_rml :flf^^-h-7-*^v^^/i/ 

i. aT©u^^f4#^setxKncRssixTv^f{j5 . r o_ c u t : a**^ vu*?** 

• r o__l mt : WAIIIIR 

— if there is a register set targeted at anything other than 

— the JC, PO f .RO or DCC 

if (D.R-l)and (C_descriptor . topnibble != 0x5,0x6,0x9 or OxB) then 

@C_des crip tor <- C 
endif 

— Pixel Organiser: 

— Operand A is the source compressed data, and should be a 

— byte stream 
if I.R - 0 then 

po_len <- OxOOOOl.len 
endif 

if A_descriptor . S = 0 then 
po dmr <- set 
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endi f 

po_said <- A 

— if there is a register set targeted at PO 

if (D.R = 1} and (C_descriptor . topnibble = 0x6) then 

(§c_de scrip tor <- c 
endif 

po_cfg.mode <- jpeg 
po_cfg.dst <- jc 

po_cfg.SS <- I.S — subsampling 

po_cfg.chan <- I,M:I4 
pO_cfg.F <- 1,F 

— DCC setup: 

if B_descriptor . if - other then 
dcc_addr <- B 

— if there is a register set targeted at DCC 

it (D.R =IJ and ( ^descriptor , topnibble = 0x5) then 
@C_descriptor <- C 
endif 

deeef g2 . cache_miss__inst<- B descriptor «C 
rinc_cfg2.mode <- JPEG encode 
end if 

— Operand Organiser B; 

— Operand B should be other , anything else is probably 

— wrong, but do it anyway 

if Bdescriptoi . if ! - other then 
oob_len <- po_ien 
if B_descriptor . 5 ■ 0 then 

ocb_dmr <- set 
endi f 
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oob_said <- B 

oob_c£g. operate <- operate 
endif 

— Operand Organiser C: 

— Operand C may be a register to set. If it isn't it should 

— be other, anything else is probably Wrong, but do it 
-- anyway 

if (R.D = 0) and ( C_de scrip tor . i f 1= other) then 
ooc_len < po__len 
if (^descriptor. 5= 0 then 

ooc dmr <- set 
endit 

ooc_said <- C 

ooc^cfg. operate <- operate 
endif 

— JPEG Coder: 

— if there is a register set targeted at JC 

if (D.R=1) and ( C_descrip tor . topnibble = 0x9) then 

@C_deacriptor <- c 
endif 

jc_cfg . instruction <- I, opcode 
jc_cfg. decode <- 1 
jc cfg. operate <-- 1 

— RcsuLt Organiser: 

if R_descriptor . S - C then 

ro_drar <- set 
endif 

r o_sa < - R 

— if there is a register write targeted at RO 
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if(D.R=l) and (C__descriptor . topnibble = OxB) then 



@C_des crip tor <- C 
endif 

ro_cfg.mode <- sequential 

ro_cfg.cut <- I.C 

ro_cfg. limit <- I.T 
A. 5 — ^n— f* j l/p 

a: 

i. ^x<D?-9=~>f4zsirifkttittEm. smfm 
^^-ftMDW&bffi tm\z$L-teti5 0 ztihwkiim&tt 



• p a c k b i t sflMHb (^-^3 ^A) 

• p a c k b i t satfe C^-^3 VB) 

• r o_c u t : y h 

• r o_l m t : tfj^fMIS 



— if there is a register set targeted at anything other than 

— the JC, PO, RO or DCC 

if(D.R = 1) and (Cjdescriptor . topnibble != 0x5 , 0x6, 0K9or0xB) then 

@C_descriptor <- C 
endi f 

— Pixel Organiser: 
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— Operand A is the source data 
if I.R - 0 then 

pollen <- 0x0000 : I . len 
endif 

if A_descriptor. S = 0 then 
po_dmr <- sesL 

endif 

po^said <- A 

— if there is a register set targeted at PO 

if (D.R = 1) and (C_descriptor . topnibble = 0x6) then 

@C_desci:iptor <- C 
endif 

po_cfg. mode <- sequential 
po_cfg.dst <- jc 

— DCC setup: 

if B_descriptor . if = other then 
dcc_addr <- B 

— if there is a register set targeted at DCC 

if (D.R = I) and ((^descriptor . topnibbte = 0x5) then 

@C_descriptar <- C 
endif 

dcn_cfg2 . cache_miss_inst <- ©^descriptor . c 
dcc^cf g2 . mode <- en/de coding 
else 

— if there is a register set targeted at DCC 

if (D.R - 1) and [ ^descriptor, topnibble = 0x5) then 

@C_descripLur <- C 
endif 

dcc_cfg2 . cache_miss_inst < - Bjdcscriptor . C 
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endif 

— Operand Organiser B: 

Operand B Should be other , anything else is probably 

— wrong, but do it anyway 

if B__descriptor .if != other then 
oob_len <- po_len 
if B_descriptor . S= 0 then 

oob_dmr <- set 
endif 

oob__said <- B 

oob__cfg . operate <- operate 
endif 

— Operand Organiser C: 

— Operand C may be a register to set. If it isn't it should 

— be other, anything else is probably wrong, but do it 

— anyway 

if (R.D = 0) and (C_de.scriptor.if != other) then 
ooc len <- po len 
if C_descriptor__S- 0 then 

coc_dmr <- set 
endif 

ooc_said C 
ooc_cfg. operate <- operate 
endif 

— JPEG Coder: 

— if there is a register set tarqeLed al JC 

if (D.R - 1) and (C_descriptor . topnibble = 0x9) then 

@C descriptor <- C 
endif 
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jc_c£g. instruction < ~ I. opcode 
jc_cfg. decode <- 1 
j c_cfg, operate <- 1 

— Result Organiser: 

if ^descriptor . S = 0 then 

ro_dmr <- set 
endif 

ro_sa <- R 

— if there is a register write targeted at RO 

if (D.R ~ 1} and { C_de scrip to r . topnibble = OxB) then 

@C_de scrip tor <- C 
endif 

ro_cfg.mode <- sequential 
ro_cfg.cut <- I . C 



A. 6 ^t«^5i^ 0 ~ cfg<limit <_ Z ' T 
1 . 3M^- Kl±0 x 4 (S*i£*) t 0 x 5 (3E 

m . 

Klft«»i:iii«*ai*<oit-(D3tv^B:, ^^ntyfl: 
Hi" 5 PR 9, H«**^tt*-*/^^y^YX3js* 



3. t^a> : 

• tr^-feyi/ O^KSrtf 5A»5d» 



-111- 



— Pixel Organiser: 

— Operand A is the kernel descriptor and the PO delivers 

— Kernel co-efficients to the MDJ?. This is ceded qg 

— "Other" 

L=l long format 
0 short format 
if I.R = 0 then 

pollen <- Ox00O0:I.len 
endit 

if A_descriptor. S = 0 then 

po^_dmr <- set 
endif 

po_said <- A 

po_cfg.modc <- convolution/transformation 
po_crg.dst <- map 

— DCC setup : 

— the implied data type here is the source image 
if f B_descriptor.if = oilier then 

dcc_cfg2. cache_miss_inst <- B_de scrip tor .C 
if 1.5 = 0 then 

dcc_c£u2 .m^de<- 64 bit mode 
else 

dec-cf g2 .mode < random ,inode 
endif 
else 

dcc_cf g2 . cacbc__mis3_ins t <- B_de script or . C 
endif 

— Operand Organiser B: 

Operand B is a pointer to source image (other), anything 



-112- 



else is probably wrong, but do it anyway 




if B_descriptor . if 1= other then 



obb_len <- po_len 

if B descriptors = 0 then 

oob dmr <- set 
endif 

oob said <- B 

oob_cf g . operate <- operate 
endif 

Operand Organiser C: 

— Operand c descriptor is borrowed by the r. c field 

— 50 OOC is not set up 
-- Main Data Path: 



— Kernel descriptor is delivered to MOP via PO 
mdp_len <- po_len 

mdp_bm.rows <- (^descriptor , r 
mdp_bm.cols <- (^descriptor , c 
mdp_cfg. instruction <- I. opcode 
mdp_cfg. long__krnl <- A_de script or. L 
mdp_cfg. decode <- 1 
mdp_cfg* operate <- 1 

— Result Organiser: 

— result will be pixels or part thereof 
if R_descriptor . S= 0 then 

rodmr <- set 
endif 

ro__sa <- R 

ro cfg.mode <- sequential 



r 



a. 7 mmw 

1. tf'O — mo x 3 

2. : 
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- if there ie a register set targeted at anything other than 

— the MDP, PO, RO or DCC 

if(D,K=l) and(C_descxiptoi . topnibble != 0x5, 0x6, OxA or OxH> tht^n 

@C_descriptor <- C 
end if 

— Pixel Oxganiser: 

— Operand A is the source pixels, only makes sense to have 

— whole pixels r anything cloe is probably wrong 
ir I.R - 0 then 

po_len <- 0xOO00:I.len 
end if 

if A descriptor. 3 = 0 then 

pc_dmr <- set 
endif 

po_said A 

— It there is a register Set targeted at PO 
i£(D.P.-l) and (Cdescri ptor . topnibble - 0x6) then 

9C_descript:or <- C 
endif 

po_c£q.iuode <- sequential 
po_cfg.dst <- mdp 

— DCC setup: 

— the i-uplied ot.h^r data type is a matrix of coefficients 
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if B_descriptor_i£ - other then 

dcc_addr < B 
endif 

— if there is a register write tarqeted at DCC 

if (D.R = 1) and (C_descriptor . topnibble = 0x5) then 

(?C_descriptor <- C 
endi f 

if B^des crip tor . if = other 

doc_crq2 • cache_miss_inst <- B descriptor »C 

dcc_cf g2 . mode <- matrix multiply 
else 

dcc^cfg2.cache_miss__inst <- B_descriptor . C 
endif 

— Operand Organiser B: 

— Operand B is the matrix co-et ticients (other), 

— anything else is probably wrong, but do i t anyway 
if B_descriptor .if other then 

oob_len <- po__len 

if R_descriptor . S = 0 then 

oob__dmr set. 
endif 

oob_said<- b 

oob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

— Operand C may e a register to Set. If it isnH it should 
— be other, anything else is probably wrong but do it anyway 
if (R.D = 0) and {C_descriptor .if != other) then 

ooc len <- po_len 
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if C descriptor~S= 0 then 

ooc dmr <- set 
endif 

ooc_said <- C 
ooc__cfg -operate <- operate 
endif 

— MDP setup: 

— if there is a register Set targeted at MDP 

if (D.R = 1) and (C_descriptor .ponibble = OxA) then 

@C_de scrip tor <- C 
endif 

mdp_cfg. instruction < - I. opcode 
mdp_cfg.de code <- 1 
mdp_cfg. operate <- 1 

— Result Organiser: 

if R__descriptor . S = 0 then 

ro__dmr <- Set 
endif 
ro_sa <- R 

— if there is a register write targeted at RO 

if (D.R = 1) and (C_descriptor . topnibble = OxB) then 
(^descriptor <- C 
endi f 

ro_c f g . mode <- sequential 



A. 8 /^7f-yM 

1. KiiO x 7 

2. ^>3^-7h^M;HIW 



-116- 



— Pixel Organiser: 

— A operand is the source pixels ox bytes 

— PO operates in conventional sequential mode 
if I.R = 0 then 

poJLen <- 0x0000:1. len 
endif 

if A_desoriptor. S - 0 then 

pojdmr <- set 
end it 

po_said <- A 

po_cfg .mode o sequential 
po o.^g.dst < mdp 

— DCC setup: 

— no implied other data type 

— if there is a register write L&Lqeted at DCC 

if -D.R = 1) and (C_dcscriptor . -opnibble = Cx5) then 

@C_de script or <- C 
endil 

if Bjiescriptor. C = 0 then 

dcc_cfg2. cachc_mis3_inst <- B^des cr ip t o r. C 
endi f 

— Operand Organiser E: 

— Operand B is the halftone screen, and should be a lile 
anything else is probably wrong. There is no sensible 

— implied "other" data type tor half toning, 
if B_descrip~or. if != other then 

obbl.en <- po len 

if B descriptor. S= 0 then 
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oob_dmr <- set 
endif 

oob_said <- B 

if Bjdescriptor. what= tile 

oob_tile <- C 
endif 

oob_cfg -operate <- operate 
endif 

— Operand Organise C: 

— Operand C word is borrowed by the tiied Operand B. 

— The C_descriptor is completely ignored 

— OOC is in random mode (default) and wont be touched 

— Main Data Path: 
mdp_bni. level <- I * levels 

mdp cf g. ins truction <- I. opcode 
mdp_cfg. decode <- 1 
mdp_cfg. operate <- 1 



— Result Organiser: 

if R_descriptor. S = 0 then 

ro_dmr <- set 
endif 

ro__sa <- R 

ro cfg.mode <- sequential 

a. 9 ^y-^tf— 

ft: 

1. ^3-Kli0x9 2. ^co^fl/^ey-^t 0 - 



• ^!i7x7/^DMA^P I C b LMCK<OW&tfy 

a. 9. i nm?-#tem 
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— Pixel Organiser: 

A Operand i3 the source data 

— PO operates in conventional sequential mode 
if I.R = 0 then 

po_len <- 0x0000=1. len 
endif 

if A_descriptor. S = 0 then 

po_dmr <- B 
end if 

po_said <- A 

po cfg.mode <- sequential 
po_cfg.dst <- jc 

— Operand Organiser B: 

— Operand B word represents a value to put in po_dmr 

— Operand Organiser C: 

— Operand C word represents the value to put in the ro_diru: 

— ■ JFEG Coder: 

jc_cfg. instruction <- I. opcode 
jc_cfg.ibo <- D.bo 
jc_cfg,obo <- I.obo 
jc cig. decode <- 1 
jc__cfg . operate <- 1 

— ResuLt Organiser: 

if R_descriptor. S = 0 then 
ro dmr <- C 



ro cfg-mode <- sequential 
A. 9. 2 ^y7x7/VDMAKi 



end 



ro sa <- R 



1. IBH«WfCt>-t5Tf*<fct>J|V\ Z<DZt\Z. I 
3. P I Cllf-^^H^^OT, ^<D^ 
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--only setup required is for the PIC: 



if I.S 



1 then 



— data in from PIC 



pic abus_addr <- R 
else 

pic_abus_addr <- A 
ondi f 

— if there is a register write targeted anywhere 
if D. 3 = 1 then 

3C_descriptor <- C 
endif 

if I . R = 0 then 

pic_abus_cfg . ab_count <- 0x00 : I . length 
endi f 

pi c_abus_cfg.ab_byte_en <- I. byte 
pic_abus_cfg.ab_7.ype <- T.R 
pic_abus_cf g . start <- 1 

— Pixel Organiser: 

— Operand A is r.he source address given to the PIC 

— The PO is not setup 

— Operand Organiser B: 

— OOB is not setup 
— Operand Organiser C 

— OOC is not setup 

— Result Organiser: 

— The Result word represents the target address 

— RO is not set up 



A. 1 0 7* hCD#S 



A. 10. 1 &¥ffim 
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— if there is a register set targeted at anything other than 

— the MDP, PO or RO 

if { D. R = 1> and (C_descriptor . topnibble != 0x6, OxA or OxB) then 

@C_de scrip tor <- C 
endif 

— Pixel Organiser: 
if I.R = 0 then 

po_len <- 0x0000: I. len 
endif 

if A_descriptor . S = 0 then 

po_drnr <- set 
endif 

po said <- A 
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— if there is a register set targeted -at PO 

if (D.R - 1) and {C_descriptor . topnibble = 0x6) then 

@C_descriptor <- C 
endif 

po_cfg.node <- sequential. 
po_c f g . dsL < - rndp 

— Operand Organiser B: 

— Operand a word is completely ignored 

— The B_descriptor is completely ignored 

— Operand Organiser C: 

— Operand C may be a register to Set up 

— Mara Data Path; 

— if there is a register set targeted at MDP 

if { D. R = 1) and (C_descriptor. topnibble - OkA) then 

GC_descriptor <- C 
encif 

mdp_cfg. instruction <- r -opcode 

if A descriptor. if /= packed bytes Chen — tell map about size 

mdp_cf g. word_input <™ 1 
else 

mdp_cfg. word_input <- 0 
encir 

mdp cfg. decode <- 1 

mdp_cfg« operate c- 1 
-- Rp.su It Organiser: 
if Rdescriptor .S = 0 then 

ro_drar o set 
endif 

rcja <- R 
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— if there is a register cet targeted at anything other than 

— the MDP, P0 r 00B or RO 

if (D.R=l)and(C-descriptor. topnibble !- Uxf., 0x7 , OxA or OxBl then 

@C_des crip tor <- C 
end if 

— Pi*el Organiser: 
it I . R = 0 then 

po_len <- 0x0000; I. ien 
endil 

if A^descriptoL . S - 0 then 
po_dmr <- set 

Ftndi t" 

po said <- A 

— if there is a register set targeted at PO 

it (H.R = 1) and <C_descriptor.topnibble = 0x6) then 

@C_descriptor <~ C 
endif 

po_<z£q .mode <- sequential 
po_cfg-Hst <- mdp 

— DCC setup: 

— no implied other data lype 

i.£ there is a register write targeted at DCC 
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if <D.R « 1) and (C_descriptor , topnibble = 0x5) then 

@C_descript:or <.- C 
endif 

if B_descriptor.C = C then 

dcc_cfg2-cache_miss_inst <- B__descrip-or .C 

endif 

— Operand Organiser B: 

— Operand B is the residuals or one operand for 

— interpolation, anything else is probably wrong, but 

— do it anyway 

— There is no sensible implied "other" data type for 

— this. 

if B_de scrip tor. if != other then 
oob_len <- pollen 
if B_descriptor . S = 0 then 

oob_dmr <- set 
endif 

oob_said <- B 
endif 

— if there is a register set targeted at 00B 

1t (C.R = 1) and {C_descriptor- topnibble - 0x6) then 

QC_descriptcr <- C 
endif 

f^ob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

— may be a register to set 

— Main Data Path: 

— if there is a register set targeted at MDF 
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if (D.R = i) and (C_descriptor . topnibble = OxA) then 

@C_descriptor <- C 
endif 

mdp_cfg. instruction <- I, opcode 

if A descriptor. if /= packed_bytes then— tell mdp about size 

mdp_cfg. word_input <- 1 
else 

mdp cfg . word_input <- 0 
endif 

mdp_cfg. decode <- 1 

mdp_cfg. operate <- 1 

— Result Organiser: 

if R_descriptor.S = 0 then 

ro_dmr <- set 
endif 

ro_sa <- R 

ro_cfg.mode <- 0 — sequential 



A. 1 1 •MMM* 
ft: 

i. mm^\*2wm<n&fc. ir^hyn-mm^ 
a. 1 1. i 7*-m® 

4. *^?^KCttRJ£1-5^V^^"et>Av\ 



— only thing to do is: 



endif 



if D.R - 1 then 



— Pixel Organiser: 



— Operand Organiser B: 



— no setup associated with operand A 



— no setup associated with Operand B 



— Operand Organiser C: 



@C descriptor <- C 



— no setup associated with Operand C 



— Result Organiser: 



— no setup associated with the Result 
A. 1 1. 2 rt»7*** PJ-K) 
ft: 

1. jj-^a — KttO x A 

2. y - k^»*x- vmzmmt So 
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3. RoimmzxyKtvVftx+i-^TZftim-o 

— Pixel Organiser: 

— no setup associated with Operand A 

— Operand Organiser B: 

— no setup associated with Operand B 

— Operand Organiser C: 

— no setup associated with Operand C 

— Result Organiser 
ro_subst <- Cbus (A) 
ro_sa <- B 

ro_cfg.mode <- Cbus_data 

Cbus (A) <- B — Pixel Organiser: 

— no setup associated with Operand A 

— Operand Organiser B: 

— no setup associated with Operand B 

— Operand Organiser C: 

— no setup associated with Operand C 

— Result Organiser: 

A. 1 2 -^ttieW associated with the 
tt : 

1. ^3-F0x0, OxFIJfi^^tltV^. 

3. Ziib<D^ffi£ft1t^\*=7°v±yV-tf$-'&f!ffi 

l. l i^***5.fcU t 7 1 -:7> 

fib <o * I* 3 m t) (D^mx-WR «rtB-c& 5o 



A. 11. 3 rt$T*-1r* (74 V) 
ft: 

1. ^=r-KI40xA 

3. ^O^te" mW &$X>\-ift^<DV, I C£tfK> 



K W5kh-Z>£?iC5%fe-rz>ztxn$L£hX^Z a 

Ktt r^Hij -CfcS. 
1. 1. 1 w^;**G>#Jfi 

« fl-lfB : sWU (frt3<DT?±X) . CBus 
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-fey h"T***V\ 

^^cccbu sifctfWK- nt-f (1"ft:b*>, t' 
y hiry h*e— K^f-JK— hbft>"0 , TK^froJ: 

^7C5)CBusgf^ Cfftfr^tfy Msy h*e-K) 
hU h¥fitSft5M©fe5 3y7 

3VhD-;H l/i>7^ 

pTffi. ay ho-/H l/'^^tt^T'COCBu sjft 



J: 9 *# ft a ^ h n-A^MitflHW* * * t 
ayl>n-/U2 U-v 1 ** 

bTIB. 3Vhn-^2y^^lW'f7*COCBusi 
^ (i-ftt>t)lfy b-fey h*-K) try 

^SB^€>l4lS-#tatfr ifcio-COKiy-fey. h 

3 o<7?7 ^ — >v K^e>#fi!c$ix-5o 
[7 : 0] t^a-/H;J:ott^*tlfefc?>*5x5 

[2 3:8] *i?z.—MZ&oX±l&£tl-tt&>b$>50i 

[3 1:24] /Hcio"C^*$*lfcfcfe^5 
«9&*tt«*rft«*?-51. 1. 2 v*S*9-*yf 

U*"T?ttfc< US>**#*"T?*>*. 
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^1.7 mm diag U-S^* 7 *f —A' K 



•7 a —n*- K 






1 : 0 


diagcf g [ 1 
:0] 


§ BSMfr/n ^7A^ diag_i t'VKftjfc, 
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J - Z 


diag dcf g [ 


H LLa^kyl y ^ y S (J A u J dU o l-s^ r ^ cJ 




1 : 0] 


UU- {3 C/tJ^pvT / S / A ' 






OXciy O L U J — rLl ' -r\ * >^ y l x v / / ✓ 3 






aiag_cl L 1 J r7^Ty s ^ *s 






diag Q L ^ J — M 7* J* 7* ~rr y | / *s y S *s <s 






diag d [3] .X y — -7 ^ — y\ h :7 ^/ y -y 












aiag a[4]=i^Efiw "p / y 'Y y 






aiag d [ r>J — ^f'fiiiT? / v ; *4 s 






01 — Tin tj B rZs^0m 






aiag_a[3 - oj — v yj — 1 r 






aiag o L *j j - *7LflfJ i7 ^ ry 






aiag a t->J — ^s!3i/tJ 












aiag o L u j — / ;/ -r t .y ;x l y i | 






aiag all] 3 / 7 <t / y- 1 •> y >*» 






aiag ai<ij — ljud c y r 






aiag a i -> \ — i juts ^ s\vj y l~ ^ 






aiag a l 4 J nuv c >y* p 






aiag a L b J — MUV ^ y^ c/y y t_ y^ 






11- e Cja^^fy^ r 






aiag dLuj fo/p^^w/JIt^ 






aiag dl 1 J -oob at o^ypqyji^ o 






aiag__0 [ c \ — UUv^ /V J D vy (jjyj^j^j- *3 






j> -i r -3 1 _i>r\ - 7 -p* y /b»y)\ '"7 }\s(T\ ~fc~ tfi CP) "X k ' — 

aiag a [ j ] -ku / / ^ /^jj^ y /vv.j s\ \ 






lis 






aiaq_a L 4 J -wd y^ ^ y y vt *o 






diag d[5]=7lcM ! 
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afel.8 mmgrst Vi^jrT* K 





£1* 


mm 


15 : 0 


msrst [15: 0] 








M^frey hi' l kick 19 y7 


















bit o^y^7i^nyhp^ 






7 






bit 1= d— ^/l'^^ U =i> h a — 7 






bit 2=Iffl^y^-7i^3yhti- 






y 






bit 3 = 1^^* v 5 a— 






bit 4=^=>>-ha-7 



£1.9 rom_gerr Ui?* 974 — ;u K 



7 4 K 
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err f 15-01 


^i?zt.*~ J\/Zlh<D7»*T^- $ * if v h 
































1 = £ <D ^ ~- y — 






bit 








bit 








bit 








bit 








bit 








bit 








bit 








bit 


7 = 3 ^7 h • ^ —tjjA irB 






bit 








bit 


9= J PEGf^-jt 






bit 








bit 








bit 


12 = reserved 






bit 


13=A?M — 






bit 


14 = reserved 






bit 


15= ¥Z.fr<D*ti>^ — /l^acy — 
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15 : 10 








J 




'J 


















bit 0=^W^^-7i-^3>hn-7 






bit 1=p— ^//u^^-i) ho — 9 






bit 2 = ^i2-f y^-7x-^^> h P — -7 






bit 3 v 5 a —/l' 






bit 4=-fft^> h n — 7 






bit 5=7*— #3r-Y 7 % >=i =»> h«3~7 






bit 6=fi3R^^7"^if 






bit 7=^7 V K • Jt—jflrJ-fB 






bit 8 = 7j-^7^ K • tf-fyf-f C 






bit 9= J PEG#S-{fc 






bit 10= ^ 4 >"f 






bit 11 = $*;*— 






bit 12 = reserved 






bit 13=A7M>'£' — 7^— 






bit 14 = reserved 






bit 15=^^^^^^—/^^^ 




nun gexp V i'T. ^ 7 ^ — K 


7 4 — }V K 
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31 - 16 


exp[15: 0] 


1/ /^i^i/J\9 { l7r ^ J v XI s y r 












}] -p"-£- 7: -A- *^ t"»" fT^I JX- _ , »r W ^, L 






^7 ll J l«+ 4^ ^> C \ 




* 








1 — K_ vj"tr y zl — /U'/J* Olj'JyhiXfiB^B^ 






bit 0— 7hpM — yx^-^^y l^n — -7 






bit 1— p — J) J\/ 7^ |j ^ y h u — y 






bit 2 — JrJj^M v/ y ^ s^-is/ h P T7 






bit 3 — tOWJ^ 






bit 4 = wir r3 ^ r P — y 






bit 5— X — y =r T V y ^ ^/ht 3 7 






bit 6— {SJ^X — Z'T't y 






bit / — xT^y y r • A 7j Jn 7 0 






bit 0 — /3 ^/z h • ^ 7v 7 -i y L> 






bit 9= J PtG^^rlt 






bit 10 — ^-i ^7 — y s^-y 


• 




bit 11— jTm^^O' — JJ ) A IT 






bit 12 — reserved 






bit 1 J — A/J a y y / ^"-/A/A^ 77 






bit 14 = reserved 






bit 15=^r^OT^^~;^M^ 
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1 C • fj 

It) • U 


exp en [15 














n X j^i l*" ^ L \ 


















bit 0— rHJIW /7^7x' — ^ ^ n — 77 






bit I — & JjJVt* tt v ^ P P 37 






bit z — JrI jZZ^i y v y ^ /n- 1 / r — 7 






oil j — toitv-t y * u 






bit 4 — fp^ 1 - 3 y r P 7 






bit 5—7 7 Tt y y a- s r p — 7 






bit b — m^<4 7J/-n V 






K -j +- -7 — -i-/v£ ^7 * ✓ fc* • -tf ~f- V" -H* R 

DlL / — ^1 '^-s+s p A JJ ) A yo 






DlL O — ^^7^P ^ 7/ y "1 ™ O 






bit 9— J JrJbG#7ric 






i31l 1U — -n y 7 /^>s 






bit 11— ^ptK^ 77 7 -1 y 






bit 12 ^reserved 






Dlt 1J — y\yj-n y v s s\ s\ a 77 






bit 14=reserved 






bit 15 = ifi^eZ)*^a.— /W-CW^ 


iron gint 


7 A —fr F 




KM 
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31:16 


int{15:0] 
















1 = ^(7>^ v^— /l^fe OM&^fi 






bit 








bit 








bit 


2 = /5]i2M — b d— 7 






bit 


3 3.— /L' 






bit 








bit 


5=7*— ?3r-V yi/=t.^y h p — 7 






bit 








bit 


7=^7 > K - ^-^f-r-^ifB 






bit 


8 = t^7>K • *— tf-f^lF C 






bit 


9= J PEG^-^ft; 






bit 


lo^-r^x— 






bit 


11=^*^— 7/-7-^lf 






bit 


12=reserved 






bit 


13=A7J-f V* — 7*-**>r 






bit 


14=reserved 






bit 


15= ^flM^^-z^f)]^ 
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1 s : o 


j.n u en i j. o 


-\? i — /l/^^^lIiA^JL^ ifc' — ~f J\y 

U — L / *S fXJ- pPJ Xc± \ S ' " 




• n i 
. y J 


C\— /T) JE« Cy , — ^^gJUA^^^E 7^yfc X I>fT) 
U— v — <yy " -X ^ /VVJ"* O vJxri K±*r HA /v '4* ^ r ^ 






prJ y ilj er /* 'c*» Kj y X — v — ~rr / ** /j *o v y ppj 






^T* *1A flS /v *4 * rv^o'J V ili<7 , fi- , «Js'y 






h-i t- n=44w£rc^ V>£ < ~7 -r >Cr^ x ^ L n . ^ 

Die w — 7rpp^ y ^ s\ - 1 >s r LJ y 






kn f- i — t-t % — -fi j\y -J cp- JJ n V k n ' -=7 

Die ± — i- 1 /V t y > — ■ v*^ p M y 






-D-L L z — /Dj a/J ^| ^ y y y\ — t y p M y 






hi +- ^ = {ih (T)^ ^ -l — /U 

JJ1L vJ | U± vy i_ ,y / ^ 






"h <t 4- / — 4^^" n V k n c "=5 






Dl C o — y x^-yy^^-^-^P 1 - 1 y 






iJ_L L o — i±±u^< yg yv y n y 






hit- T = ±/< c 7 V k* i ^1- — ^ti^r y( +f R 






x^xoij si s \ si yv y | y v> 






DlL j — J re ^ttTtju 






hit- in— — % y^x 






K . . -IT— fcfcm^h. ./f J- y jf* 

DlL 11 — JTptN:^ A/ V "1 y 






t" 1 9 — r^<?pn;prl 






hit 1^=7Wl — *7^c- — XX/f ^vf 






bit 14— reserved 






bit 15= <t ^_» : »CD' : ti/ci.— /u-^fiJ^.^- 


m 1.12 


mm active 


U'v'^^7 A' K 


7^— /UK ] 
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15: 0 


inactive [ 1 






5: 0] 








bit 0 — TrfnW y ^- — ^ r p 7 






bit 1= ci — * !J ^ h n — 5 






bit 2 = j§323-i =3 ^ |- d — *7 






bit 3— liE^^V^. — 






bit 4 — v\iyj ^ S r & y 






bit 5 = 7 ^ y ^ ^ h o 7 






bit 6=Wi^ t -//T4 It 






bit 1 — ^4^7 y V * -4 — 77 v A v o 






bit 8 — V S r * A 777 -1 v O 






bit 9— J rliLrf^^lL, 






bit lu — s A yj y /^s> 






bit 11— ^^^T^T/v -f IT 






bit 12 = reserved 






bit 13-A7J1 w/? v -7x w -a^^| !/T 






bit 14 — reserved 


Li ■ — 1 LHJ # 1 




bit 15 = ^^.^(7^^^— Jls&Ttrr-jy 



^3 

I . i c c f g 

i c_c f gWS?**tt3o©to#fcai;h,$. *Tf4^ 



CO 'J -fey HStiO x 0 o o ooooo-?&5„ 



&1.13 ic cfg U-^^^7^-AK 



7 4 K 






1 


E 
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2 


B 


1=;^ Ax Lftt/ -1 


3 


S 




[5:4] 


pri[l:0] 


0=A (i^V^) , B (fficVO 
1=B (fifV>) , A (fiV*) 
2=j=l#§? (round robin) 









6 


asr 




7 


pd 




8 


cbus eic_dis 


cbus 1M ? As<dW\ etc ^"cxd^^^V * 
l-ir i" Xx-^ 


9 


cbus mm dis 


cbus 1M ^yK7)R{]MM}6^iZ>^^ rw 


10 


cbu3_int_dis 




16 


a__en 


1=a ^ M) — A-f-ts— 
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17 


a seq 


o=a xhV—A$"-tryx&$7 : s{Z.zz— 
1= A * h U — A->— y 


18 


a_prefetch 


o=a * hU *-A3fc& 5 xoi— 7> 

1= A * h -J — A$fcfl*»J yf^—^JU j 


19 


a_overlap_en 


0=a ^ h !)-Al§r-{^x-//i/ 
1= a * h y -Afig^^-y/U 


20 


a snc pause 


o=V— jr^#*3fcj«9ii*^ A ^ h 
y — Afl5f£ikL£V> 

y -A^ffik 


21 


a_ic_pause 


o=#^SfeT9l U ii^T\ a* h'J—J»&W- 
it Lfcv^ 

1= ^^3fe7fl 9 a * h y - AtfSf? 
it 


22 


a auL piiiae 


0=A* h y -ACDv— ^^'^S^TS'J 1 ? 


24 


b_en 


0=B Xhy-Af-f^x-^ 
1=B* h "j-A-f 


25 


b_seq 


0-B* h ^Ai/-^y^f^i^ 

i= bx h y -a->— tryxmn-J ^— y 


26 


b prefetch 


0= B * M - Aft-Hx. 0 *? 4 X^—f/V 
1- BX M^Aftl&D'f #-:7> 
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27 


b overlap en 










1= B* 





7 4 ->v K 




mm 


28 


b sncjpause 


1= ^-^^^TH"?^"?. B* h 


29 


b icjpause 


1= •fr^TS'J'?^^ BXF!) -A^ff 
it 


30 


b autojjrime 


1= B* h y — J±<Dis— ^>*#^5cTS<J9 



m. i s stat 



xOOOOOOO o 



3R1.14 ic stat I'^X^-f-yl'K 



7 4 
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[3:0] 


ec_state [3: 
0] 


6= * <d mftm Ktmr ? ± * t <o m 


8 


overlap 




9 


ic interrup 
t 




10 


jump 




11 


prefetched 


vA» iA^_ >6w Jlj Al— IT?-. In w— » > — /J^ Tt. A- 1 _ 










12 


stream 


1=* h y-j^B 



-153- 



13 


pref_stream 


l=* h V — J» B i 


14 


condition 


_ >tr Sal. • * ■ « -r - r r . t _ % » o 


16 


a_wait 


o=a * h y—J^frMlftMWP 

1=A * M;— <Q&St 


17 


a_async 


i=a* h y-AT^^o^^iifTf 

1= Asynchronous transfer 
in progttss in stRttrn A 


18 


a_busy 


0= A^. h ]) -A.fr k V— 
1= A* h y — A^Ifv?— 


19 


a sem 


A ^ b y — .A ix^x^wir-v^^-^^— ^ 


20 


a_lock 


0=A^ F y — y9 £3xTW£V^ 
1=A* h y— A#Sn &*Vf^<5 


21 


a primed 
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22 


a paused 


A -X h i) —A^ih 

o=a* h y-Atf*^7HW«9a*"tM*ifcL 
1=A * h i) fJ t> iA^-C#lh 0 


23 


a_ol_primed 




24 


b_wait 


o=B* hy-AdSii^fTff 

1= B* hl)-Ai5tT7tl:J; t^lt 


25 


b_async 


0= b * h y - j*x-ftmm<D&mm'<7 * 

1— Asynchronous transfer 
in progttss in stRttrn A 


26 


b busy 


1=13* h y-^#tf^- 



7 W — ^ K 






27 


b_sem 


l=-fc v 7 * jWgflT'&ft £ KX \ ^ 
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23 


b_JLock 


1 = B* h9—J»tftiy?£iiX^Z> 


29 


b_p rimed 




30 


b__paused 


B* h J^it 

0=B* h U-A^j.7-tiJ«9iA^r'#lh 

1=B* J»tf^y— « t> iA^Tfteii: 
„ ayBtyt^ 1 ^rS^tPi i {I J: 
tr 


31 


b_ol_prime 
d 


0=B* hy~A<7) 

l— JdJa hy "-av; M fetw it V — r? Zs 7* 



i c_e r r int 3d»Sr*-f", T^rw^ • ^W77^*^tf. *■ 

<z>W**i* i crtttTWO ut 3n#ti<o\:y Mii**# iitfiiT^ yr^tvs. 



16 1.15 ic err int l^^??^^ K 









8 


a ill err 


A * h !J— A3pjE*^7 — 


16 


b ill err 




24 


a snc int 




25 


a ic int 
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26 


a^sns int 




27 


a_is_int 




28 


b snc int 




29 


b ic int 




30 


b_sns int 




Jl 


b__i 3_int 





o. ic err__int en 



1.16 ic err int en V^^7^-/VK 







mm 


8 


a ill en 




16 


b ill en 


B * h y-A^iE^^^y — f*-:/^ 


21 


a_suc_int_e 
n 


a * HJ - A "iz-^y^f^T" »Jii^ 


25 


a ic int en 


a * h> - a "***t" «a*-f 


26 


a_sns_int 




27 


a_i s_int 




28 


b_snc_int 


b* m-a "^-^^s^r'aia* 


29 


b ic int 





30 


b_sns_int 




31 

i [) a 1 


b_ls_int 





fc5<k u-co 

fc{g^$tV5 0 iCDU-^^^cDlJ-fe-/ hfltteO x 0 0 0 

ooo o o-efc5o 
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1,17 ic ipa U*?xfry 4~ 



q. i c_ t d a 



y 4 —>v K 






[31:0] 


lpa(31:2] 





^yt^/ MlteO x 0 0 0 0 0 0 0 0*Cfc5 o 
*1.18 ic tda t^**:7^— /UK 



r . i c f n a 



7 4 K 






[31:0] 


tpa[31:0] 


A* hV— A"todo"tt 



UlteO xOOOOOOO 0T'fc5„ 
&1.19 ic fna W-v>*^7^ — /UK 



7 4 ->V K 






[31:0] 


fna[31:0] 





s . i c i n t a 



0x0000000 0X*h% o 



1.20 ic inta l"isX?7 j — ;I/K 



t . i c 1 o a 



7 -f -/u K 




mm 


[31:0] 


inta[31i0) 





ft 1.21 ic loa Ui?X?7j~ /PK 









[31:0] 


loa[31:0) 


A* by -A "S^S^^v'- 



u. i c i p b 



0UH££$tl$ o :^)l/^^0!Jt5/ MIteO x 0 0 
00000 0tfe5 o 



311.22 ic ipb \sV*97 4—AsY 



7 -tV K 




mm 


[31:0] 


ipb[31:2] 





v. i c_tdp 

:©l/^?|j:7H)-AB« "to do' 



140x00 00000 0-Cfc5, 
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gtl«23 ictdp l^^^7^-/VK 









(31:0] 


tdb[31:0J 


B* h!/^"todo ,T M 



w. i c f n b 



Mifio x o o o o o o ootfc5 0 



1.24 ic fnb U i?X $7 4 —fc K 



x . i c i n t b 

Z<Dlsi?X#teX b })~J*B<D TfiJtJ^j 



7j-;ub* 




mm 


[31:0] 


fnb[31:0) 





0x000000 OOtWo 



1.25 ic intb y 4 —/W 



y . i c lob 

^3 2b*5/h (yyt°>?) (D^-^^X^Zteft 



y j ->v k 






[31:0] 


intb[31: 0]' 





Otfe5 0 



1.26 ic lob \sy*#7 <i~jVY 







islw 


(31:0] 


lob[31:0] 





z . 1 c s e m a 

a a . ic semb 



Z.(DUyx?\Z i c __s t a t ^^^COgfHtfflSrffl^ 
fc^yT*T*fc!9 x r^U^^co^^tt^ bV 

a b . i i s c f g 
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.A 



ifil.27 iis_cfg 



7 w K 




mm 


[31:14] 




Reserved 


[31: 12] 


po p 


po©M (o=fti£ N i=ftj«) 


[11:10] 


dec p 


DCC?>«;g 1=M) 


[9:8] 


icp 




[7:4] 




Reserved 


[3] 


S 




[2] 


B 




[1] 


E 


0=7* -f ;xjq— -7Vu 



ac. iis stat 
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£1.28 lis stat V*?**7 4— /UK 





m 


Wffi 


[31 : 25] 




Reserved 


[24] 


i prefetch_po 


i_pre fetch poim^^^T— b 


[23] 


i prefetch dec 


i prefetch dec it^O^^r— b 


[22] 


ijjre fetch ic 


i_pre fetch IciM^r^^-T' — h 


[21:20] 


lmc_po_lpri 


imc m<o po mm<Dmft&<D^ 


[19 : 18] 


lmc_dcc_lpri 


imc m<o dcc mm<om9£M<D5iW 


[17 : 16] 


liuc_ic_lpri 


imc m<n ic m&<Dm?em<D&w 


[15 : 14] 


eic_po_lpri 


eic m<D po mm<Dm9c&o>tm 


[13 : 12] 


eic dec lpri 


eic m<o dcc m*<«fc&<a£# 


[11 : 10] 


eic ic lpri 


ElC/fitf) IC &&0>&c&<D$m 


9 


lmcjgnt po. 


1=P0 # imc <d$1»rI 


8 


lmc_gnt_dcc 


1=DCC # IMC <73 SjWFRT 


7 


lmc_jjnt_ic 


1=IC # IMC^IJM^J 


6 


eic gntjDO 


1=P0 7)5 EIC <DM$)flFfi]' 


5 


eic gnt dec 


1=DCC # LEIC <DfflW?°I 


4 


eic gnt ic 


^IC^EIC^J^pT 


3 


valid_req_po 


l^po^i 1 ? iisWEU^Wft;* 


2 


va lid_reqjdcc 




1 


valid req_ic 


l=ic t'<fc «Q iisfcJEUS53fc& $ *$ 
titc 


0 


i source 


li source is^-^t^Sl 



ad. iis err int 
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^1.29 iis err int \si?X#7 4— 



7 4 —IV K 


£^ 




[31: 16] 


interrupt 




[15: 0] 


error 


tr^ K 3 =x -c us K i 

tT y • h 2 =^£&& f- oMitM 
\f y h 1 =^&te%:^7°wt yW&W. 



s e r r in t e n 

^ 1.30 iis_err_int en W^^7^f-/UK 



(try h) 




film 


[31: 16] 


int mask 




[15:0] 


err_mask 


— 

fc'y h 3 =r 4 xx-— us £ I 



s ic addr 
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iis ic addr ViP^ft? H — 7U K 



a g. 



iis dec 







mm 


[31:0] 

q d d r 


ic_addr 





^1.32 iis dec addr \s i?X #7 <i *—A* K 



a h. 



l l s p o. 







i&9J 


[31:0] 
J ddr 


dcc_addr 


^4 dcc l:J:oT5#Sfift 



a i 



i i s bur 



^1.33 iis_po addr U-^^^^—^K 



7 -< — /u K 






{31:0] 
J t 


po_a ddr 





iis burst U^^7^-^K 



a j 



i i 



[Xv b) 




mm 


[13:0] 




Reserved 


[21: 16] 


po burst 


PO t)>tb<D/<— 7 hjf 


[13:8] 


dcc_burst 


DCC J&»&0V*— 7 hft 


£5:01 


ic burst 

r — J 


IC i*b4>'<—X Nft 



1.35 iis base addr kS?X^ 7 -f —A' F 



7 — ^ K 




mm 


[31:12] 


the 

coprocessor _base 
[19:0] 




[11:0] 




Reserved 



a k. 



iis test 
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*l-3€ iis test 



7 4 —sis K 
(tV h) 






[31:4] 


test mode 1 3 ! 0) 


Reserved 


13:0] 




0=IC->EIC, PO->LMC 
0=IC->EIC, DCC->LMC 
0=PO->EIC, IC->LMC 
0=PO->EIC, DCC->LMC 
0=DCC->EIC, IC->LMC 



J I 0=DCO>EIC, PQ->LMC 

fl^f y^7x^ny|>D-7 UiSx? al. e i c__c f g 

&1.37 eic_cfg U^^7^f-/PK 



(bits) 




shah 


U 


reserved 




1 


enable 


£T<£Eicc9$iffc£: v pci k$\JB'<x(D 


2 


bypass 




3 


reserved 


eic -7W Kco 1 o|fefi^n^± 


5 : 4 


pci_arb 


PCI /^COifcg 

01 = IBus<S?fc 
10=RBusfiE5t 


6 


pc i_ibus_mLn_l 
en 




7 


pc i_rbu3_mln_l 
en 




8 


pci_mrl_mrm_en 


pci **yt0KfflL9^ 9-K 


10 : 9 


gen arb 




11 


genjclk 


0=8UB^^n y ^tclk^) 1/4 
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21 : 12 


irmn mask 




29 : 24 


hash_func 


9 J>T—'?/i>/i>y ^ 7 -7 y-e{£ffl £*i 


30 


hash bypass 


U^-v^-^T^ib 1 3 fy h 


31 


mmu bypass 





am. eic stat 

^1.38 eicstat /V K 





m 


m 


(bits) 






0 


ibus_pci_act 


Ibus PCI '<*^<nT$*T4 7HS5ft& 


1 


j±)us_pci_act 


Rbus PCI '<X^cOT:?^:/5?jfc£; 


2 


mnu act 




3 


gen_act 




4 


ibusjpci_gnt 


ibus ffi pci /^(rffpT$ix5 


5 


rbus_pci__gnt 


Rbus jfi pci /-WcSfpTSixS 


6 


irmu_pci_gnt 




31:4 


reserved 




i n t 




£ c PCl3y7^ai/-; 



e i c_e r r_i n t (D^7~-RTf&l<0 f— 5 J. y — t* y M3 P C I > 3 

WJ;oT!)ty hT-#S„ ii^fi0^y-S.OfSiJ9)i^ V\ e i c_e r r_i n t T'COa tf-ttflj 

try htt?Otfy MCI fey -fey h3H t>*»U*V\ 
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£1.39 eic err int W^j^f^vUK 



(bits) 


m 


BBS 


0 


page_fault 


<<~&7*—. Y^y -t-J: V-ty h&ti 


1 


prot Eault 




2 


<jen_err 


< <!r 1 1 OCO genelk ^/WTn1 s 


3 






4 


iivd_e tr 


?V ^-^/V^^^tETC ^ Rhus $ 
Illbu^Jfc W p b tfi 0 (75 ir S 


5 


ta rqe u_<ni>oL L_e* * 


pci t*^^~;*lcteVvt^ofc 
Targe tAbort try ^ 


6 


cfciate i;_aDort_er r 


PCI |^^<-^iCA>V^Ttil/J^tlfc 
TaxgetAbort h^r^fc*— 


7 






8 


da ta_paxl ty_exp 


pci tr*>v vr^ii L/^r- ? 


9 




'pt£<kt>l '><D genelk *;WBk 
gen_cxpl VWOiXhZMblSy hZtl 
*> 



23:10 


reserved 




24 


gen int 


^<it)10(D gen_clk iNf * / Wlffl. 
genintl t'MO&^nS fc^y hSft 


31:25 


reserved 





c e r r in t e n 
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eic err int \s&?*9 y 4 — iV K 







mm 


0 


page fault 


ttfci~Z>^y - My b %-y*Mz. LX 
c err ^irHifr-^S 


1 


prot_fault 


mfc*Z>^y-My b*<4 *-7Mz.LX 
c err ^rfch^Ji ~<5 


2 


gen err 


*j-ffri-5^7 - h £>r lx 

c err fctftfjtS 


3 


gen_addr__err 


ttfc-tZ^y-My h%-(*-7MCLX 
c err ^Hi^-f 3 


4 


rwd_err 


fti&tz>=>7 - My b &4 LX 
c err ^attrf £ 


5 


target_abort 
err 


*TJ£-f 5^ 7- e*s> h ^r-f *—zfMz LX 
c err &Htfrf-5 


6 


master abort 
_err 


ttf&tZ* 7 - m y V %—y*Mz LX 
c err Srffl^Ji"5 


7 


reserved 




8 


data_priorit 
yexp 


ttfctZ^-y - My V %--/Mz. LX 
c exp %H5±)1rZ> 


9 


gen_exp 


ttfci-Z^y- My b *-y*ML LX 

c exp Srtftfrt^ 



23:10 


reserved 




24 


gen_int 


c int 


31:25 


reserved 





ap, eic test 

*1.41 eic test V&7s#y 4*—AsY 



aq, e i c p o b 



{My b) 




mm 


31:0 


mm 
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g£ 1 . 42 eic_pob U^* » 7 4 ~/U K 



(try*) 




mm 


0 


gen _pobO 




1 


gen_pobl 


>-publ 



a r . e i c_h i gh addr 



$1.43 eic dual_addr K 



(try H 




m 


31:0 


high addr 


*7vir*, f-f^*ffBft4«Sfe LTV 



as. e i c w t 1 b v 



1.44 eic wtlb v ^^7^-^ K 









1:0 


pt_ flags 


#&*;ix£ TLB h V -CMS L 


11:2 


reserved 




31:12 


vpn 
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a u . 







3S3 


0 


inv_all 




1 


±nv_entry 


5&K£;ft$o TLBtoiM^Sv Eicfircohr^ 
yr-rs 


2 


wrlte_lru 


h T? LRU {£BI<I TLB^y h y -£S&&TLB <7>Sb 

fftg, eic li^co tr y h Sr^ y r-f £ 


3 


write 


hy-Sr^iitfo TLB<Dmm&. Eictt^ev f 

^ry y /To 


7 : 4 


ent num 




31 : 12 
1 v L 







£: ^(Dl^^X^CDffifi. MMU^-^7i-/Hi T&V^ib, V^O"Ct>^}E ^Ttg-CfeSc 

1.46 eic_mmu_v y 4— jvy 







' 


3 : 0 


mmu lru 




4 


wxu hit 


&&<D tlb T^ir^^—^x— 
it<5 


5 


mmu_retry_h 

it 


tlb T ^i?* 
(D^Tt 5/ £ > v_mniu_retry_hi t tf* 


11 : 6 


Reserved 




31 : 12 


nmu_lvpn 





a v. e i c mmu p 



a w. 



e 1 c i p a 



^1.47 eic mmu p U^^7 4 — A* K 



31 :0 
I d d r 



mmu lpta 
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^1.48 eic_ip_addr U $ 7 4 — K 









31 : 0 


libpa 


PCibus fcr * ir^^ tot £>fti5 



ax. eic rp addr 

ft: :oi/^^<^ittRBRMBu sHPCI/^ 



a y . 



eic i g a d 



^1.49 eic rp addr V*J?s $7 4 —/V K 









31 : 0 

d r QL i £ CO 


lrbpa 


PCibus f^Tt^ fc> o t fcS; 



^ 1 .50 eic ig addr # 7 4 — K 



az. eic^rg a ddr 



31 : o 



irbga 







mm 









31 : 0 


itbga 











b a . 1 m i c f g 
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0:1.52 lmi_cf g U $ y ^ f? 



7^— /UK 
(fcV b) 




m 


[31 :24] 


ref interval 




[23 : 22] 


ro_prio 




[21 :20] 


pic_prio 


PIC Tt±*mft& (3=*«) 


[19 : 18] 


ic prio 




[17 : 16] 


iis prio 


us TP^^naSL (3=*S5) 


[15 : 13] 


rearb interva 
i 


KtfeitSSWlWWBB (2 ~n K) 


[12] 


mem_enable 




[11] 


banks 


ffi^tLTV^5DRA^^(^R(0-l # 1=2) 


[10] 


iis_fiush 


) 


[93 


iisj>refetch 


iis<Q$fcJ&!?^*-:7Vu 


[8] 


refresh en 


DRflM <DftW}}) 7^ */^4%—7i\> (CBR) 


[V] 


seamless 


jffiS©*v^ sdiwm i?y*~- fV Y4%— 


[6] 




Rservcd 




[5:4] 


bypass mode 


/V/^^e-Ki^: 1 

00=Rbus 
01=Abus IfiZ^ 
I0=Abus y~K 
ll=Ibus 


[3] 


S 




[2] 


B 




[1] 


E 




[0] 




Reserved 
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£9ttP1r? hfltO x 2 0 0 0 0 1 0 O$rt>o„ -f^T ^ ( E = o) T*fc5c !)7Uy^|iElfy H:jKf 
^•C©T^-fe^«||Stt<i!U< OKKjeStLS. y7Uy bb. 1 m i _ s t a t 



3E1.53 lmi stat 







m 


[31] 


ro ca 




[30] 


pic_ca 


pic ** : 9^tVfy't4iVT9*r4-f 


[29] 




reserved 


[28] 


iis_ca 


IIS *S?3L— A^M 9!V79T<iZ? 


[27] 


ro_cp 




[26] 


pic cp 


pic ^v^^Mf-i' ^/i/^yf^ 


[25] 




reserved 



[24] 


iis_cp 


IIS *ei?**-/U$-'{ p/is^yy^j l/ff 


(23 : 22] 




Reserved 


121 : 16] 


fltateA 




[15 : 13] 




Reserved 


[13:8] 


stateB 


ha— /U*^— hB 


[7] 


rd active 




16] 


wr active 




[5] 


rf active 




[4] 


rf jjending 




[3] 


iis_pre_active 




(2:1] 




Reserved 


[0] 


sdrarm 


sdrarm 1 A^JtVcO^ 



i/yt±CBus-fy^-7j-^2g^ny^tI be. lmi_err_int 
■l $ tvtfc 0 , ft o t ftfr CO 8 0 MH z £ o y ^ 2 
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^1.54 lmi err int # y 4 —/V K 





m- 


mm 


(31 : 24] 


interrupt 




[23 : 8] 


exception 




[7:0] 


error 





St 1.55 filtiii^f v b<Dfe& 





m 


24 


vy u y 5/ fro #!)7UyViHII (6 
4ms) 


25 





1.56 — fcfyh^Sl 



:©^?ttjty Hg b x o o 



DRAMAS -Y * — ^ /U "C 4 B$ fc DRAMfc: T £ ir 



Rbus 7K^X7- 



Cbus T Kl^^y- 



b d. lmi err int_en U^yTs $ 



igl.57 lmi err int en U i/A $7 4 K 









[31 : 24] 


int mask 




[23 : 8] 


cxp mask 




[7:0J 


erx__mask 





T'#5„ :©l/^^|ilmi_err_i 

Uv^:?©tf y htlmi_err_int l^v?*^© 
r r_i n t_e n l^*;?©^©!:*;/ MWftCft 



ofck 1 m i_e r r_i n t U-^^^gtSfy 

A^5— * W*&$V^I±tt9&#f&f\ c_e r 
r, c_exp, fe§VMic_i n t^f4T$5, 
U 1 m i_e r r _ i n t_e n lsiSx#<D%ffe<D tf y 

9T£tl1thtcb lmi_err_int 
9<T>fttetZ>\Zy h£S*afc&9, c_err. c_e 
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^^Hy Hitter ©a: 9 -&tfW 9 ii*JR be. lmi_dcfg 
£i§#ltC-fS 0x0000000 OTfcS. ttffl**i<e 



1.58 lmi dcfg ^ 7 -C — /U K 



(try W 




mm 


[31 : 29] 


row bits 


DRAM (D^f ( row) T K k"y f>|£ ( n +6) 


[28 : 26] 


column_bits 


DRAM <£>?IJ( column) 7 KU^tf-y ( n + 6) 


[25] 


edo dram 


EDO DRAM (0=FP, 1=ED0) 


[24 :22] 


precharge e 
nable 


T'lif-l'-^WiT K^t> h (n+5) 


[21 : 19] 


prechargejD 
ankl 


'/V ^-i? • '^WT K tfV h (n+5) 


[18 : 17] 


cas lo 


CAS CDfg;^ n $* ^ f'^ >7 (CAS x^/U^i|ig 
) (1 Jhm 4, o=l) 


[16] 


cas_hi 


CASCiS^tay^f-^^/i^ (CAS tf>7Vf- 
(1 2, o=l) 




[15 : 13] 


ras_lo 


RAS©Sr/htt:J r ny^-!M'^/u (175^ 8, 
o=i) 




[12 : 11] 


ras hi 


^+-v>) (1753* 4, o=l) 


[10:9] 


r as cas 


i> 13 S^lNf ^/HLfcttS RAS CAS ^(Dil 
ft (2-6, 0=2) 


[8:7] 


cas 1 at 


* a j/ ^ IM" /HC&tfS SDRAM 
CAS latency (?t&) : 

0, 1=1^D^ 
3=3 Po-yir 


[6:0] 




reserved 



*ftfflr*»fr©f-^X^3i/7-f ^aU-S/g^Sr* HIOxOOO 7 f f 8 

*r*ISW'<5>< Z<DUiSx?i$t bf. lmi_mode 
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8rl,59 lmi mode 



y 4 — /u K 
H 




mm 


[31 : 14] 




Reserved 


[13] 


initilize 


SDRAM <?>V)fflik {7u puis 


[12 - 0] 


sdram mode; 


SDRAM ^ — Kfcf *y K fcnRZiM dvV K*^>*Ttffi 



fctf^-fS, rov^^ji^fcK^f prut?, 

itV-y hairs' h-TSr <tirj;oTSDRAMtC##i2, 
A/T-fc&VX, ^<DUi/x?}-±}) -(ry HlO x 0 0 3 7£ 



-?&*U* % SDR AM*- K * ^WfMzrf n :/ 
bg. pic_cfg U-^^ 



1.60 pic cfg ^^7-f-/VK 





45* 


mm 


0 


reserved 




l 


enable 




2 


bypass 




3 


res erved 




4 


big endian 


big_endian Wh, ^» fcf 
y h 3 i:24T\ i(it'>7 h 7:0 


5 


video mode 




7:6 


reserved 




8 


gate_out_c 
lock 


vclk en in LTSteS-fr-S 


9 


inv out cl 

ock | 


\?T K-Cffiffl, tftfj* n j/ * £ 
vclk inKitLT/x$5$it<5 
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10 


use defau 
Itjdata 


y hZtlX^&k, defalt_dataUv*;** 


11 


default en 
_source 


hfx^ffl^J^— KT'tffl, <b Luse_defa 
ult_data;6W#— •fA'-ttblMis defaul 


12 


data_en so 
urce 


t^ffi^-' KTffflo IOif, vdat 
a_en_in_ld$iJ-y://U£tL, HJ^StW 
5 t#vdata_en_out_l^ s tt}^3$tl/-5o — 

* tLX 1/^5^ vdata_en_out_l JJ s tB^3 £ 


13 


del_data 


fc'T^to^J*— KT«fl 0 vdata_en_ou 

t_iteH LrtttfjT*-* * i * p y ^tM 


14 


ga te_with_ 
rdy 


rdy l^htS 


15 


reserved 




16 


forward_ac 
t_dis 


JiJ^FIFOOx— 9c_active<DtK^J 


17 


reverse ac 
t dis 


A^FIPOWT'^fcJ; t)c_active<Z>ttt;*J 


18 


forwarder 
io rity 
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19 


reverse_pr 
iority 




20 


forward_i 
nvert_dat 
a 




21 


reverse_in 
vert data 




22 


fonward_ i 
nvertjcont 
rol 




23 


reverse_in 
vert_ cent 
rol 




30 : 24 


cbus_ timer 


AjMfcmj] : r~?x h})~-J* cbus 9iv 


31 


cbu5_tixrer 
disable 





c s t a t 



^1.61 pic_stat V*J>7,$7 4— iVY 



7 4— IVY 




saw 


0 


abus act 


ABus <Dby >J v- g ^^-rV >- ^ 


1 


ribus_act 


RBus <D h 7 Xlf 9 v a isfr^<lsf4 y# 


2 


reverse_act 
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3 


cent_cfg 


Woent cfg tVOT^t- 


4 


forward full 




5 


f o rwa rd_s ubwor 
d 


PIC©tt|^ETro^0J^±4^W b&Cb~<DX 


6 


f orward_emp 

ty 


PIC <D\&?) FIFO ti^<D#f\Z.-t >y b 


7 


reserved 




8 


reverse full 


PIC ©A^ FIFO i$7/l>>(DV$f\Z.-k y b 


9 


reverse enpty 


PIC <Dtiif} FIFO t)^(Dm Id* ? F 


10 


erwrsej ubwor 
d 


PICCDA^JrTP0{C0^±4^<-f hOT^ 


15 : 11 


reserved 




16 


picjjlh 


pic plh fcVOM 


17 


pic_ack_l j 


pic ack 1 tfV©:Kflg 


18 


pic_busy 


pic busy ¥><DVm 


19 


pic perror 


pic perror fcVQtRfi 


20 


pic select 


pic select t>®tl j 


21 


pic_fault_l 


pic fault 1 t >-<D^^ 


31:24 


pic_data_in 


pic data in fcVO^S 



bi. pic_err_int ©^-{C «£oT U -fey h £*t3„ ZtlZfttD \fy Hi 1 £ 

P i c_e r r_i n t l'i?X?(D*y-$S£V%\')& »#atrr. & "C y -fe s> h ZtlZ 



^ 1 . 62 pic err int VS*.* * 7 ^ — /l- K 
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0 


f orwad_err 




1 


rwd_err 


7^z-^i7- OftrCOgjfc, PIC #7* 


2 


timeout err 


mm* 4 ATP 


8 


reverse exp 




9 


cbus_exp 


^5, K^s-fe y k k , Airily- 

V - -A© CBus "tr-Y P/H)^b IsXWMZ 


24 


rev_ccnp_iiiL 




25 

■ 


rev_daUa_ii i u 




26 


abus_corrp_in 
t 




27 


timer int 


* Y t Om* 1 (DIMWIT LfcH#{C-fe v V 


28 


conp_£aul t_i 
nt 


T'pic_fault_L m %-i>mtl Ztltz £££ 
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29 


cornp_perror 
int 


pic_perror « ^tti^ Sttfc fc £ K 


30 


ecp_rev req_ 
int 


t>hn^^7 ECP KT^iS^)^^ 


31 


no_periphera 
l_int 


fit i vi r4-y» f N *t — # V . » 



c_e r r i n t__en 



& 1 . 63 pic_err_int en £ 7 K 







an 


0 


forwardjerr 


Tc intSrW^7^« 


l 


rvrijtrr 


Tc int£tftfj£-££ 


2 


timeout_err 


Tc int££Hft$i*:5 


8 


reverscexp 


Tc int*m*s-fra 


9 


<±>us_exp 




r 

24 


rev camp int 


Tc intSrtffc/j£i^5 


25 


rev_data_xnt 


Tc int£ffi^£it£ 


26 


ab us__oarp_in t 


Tcint*UW£*fi 
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27 


timer int 


Tc int£tlttj$ii:3 


28 


conp_f aul t_int 


Tc int 


29 


coup per r or in 
t 


Tc int £ffi#£i£5 


30 


ecp_r ev_r eq_in 
t 


Tc int^ffi^iirS 


31 


noperipheral 
int 


Tc intfcffl;*j£i$:5 



b 1 



b m. 



p i c a b u j 



55 1 . 64 pic_abus cfg V $7 <( K 









23 : 0 


ab_count 


tHBfi££ftZ&lc pic fc± 9 - i 


27 : 24 


ab byte en 




28 


abjtype 




29 


ab_the 

coproccs- 

sort 




H r 


reserved 





*1.65 pic_abus_addr U$?*#74"~ /H** 



31 : 0 



ab addr 



mm 



c II I U f K. 

.cent_cfg U-^^tity hnr.^xt 
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^1.66 pic_cent_cfg \si?*$y 4— /U K 



(try H 


£1* 


tftW 


2 : 0 


cenjund 


0 00 - ^ f - y ^ ^ .j^ 

001 = ~;//l^Jfc 
101=ECP^fc 

ho=ecp ^/SLTr^-Y^ I 


3 


cen_exe 




4 


cenjsts 
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7 : 5 


cen mode 


000=3 i//*f*7/i, 

ooi=-^V^^^-y/i/T'3 y^^/u 
oi o=^77u&&ffi LTt s >'W * id 4^5-5 
oii«»?Vi-*— K 

100=reserved 
lO^ECPMMij'InJ^e— K 

uo=EQ) £H£ffJ ItrM ^ id £ri£5 
iu=ecp Jfe*!^*- K 


8 


cen_direct 


v i> * ? w&^mzm&ft-r r 1 k <t 

19, y 7 h £ j-T^jiC^ctli^ tr y V* 


9 


cen_host_rec 
en 


ECP h StitM-f^-^^ 


15 : 10 1 


res erved 




23:16 


cen_tim 


/rx » . J , . . 1 _ T__ | - J -fi V V- j _ /r~i 4— t— 


31 : 24 


cen_per_tim 





bn. pic cent dir 

^1,67 pic__cent_dir i/^»7^-^P 



7^— /t^- 




m 


0 


pic_data_oe 


pic data oett^Qg&MR^!/ ^ 
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1 


pic strobe 1 


pic_strobe_l fa^iSBcM#t: y h 


2 


pic autofd_l 


pic autofd 1 {a^iffl^J#t: :/ h 


3 


pic_init_l 


pic init 1 fs^ilMP t 5/ h 


4 


pi cjs elec tio n_l 


pic_selection_l {i^jffigftlJ^ItfV h 


5 


pic buf_en_l 


pic buf en 1 ft ^|Sg$i{pfc> h 


6 


pic_bufjdir 


pic_buf_dirfe^©]l®Wffllbf y h 


7 


reserved 




15 : 8 


pic data out 


pic data out iM^^iS^rMi^It y h 


31 : 16 


reserved 





c r e v e r s e c f g 

^1.68 pic_reverse_cfg \sj?^$ 7 <j •—A* Y 



(if y h) 


m 




23 : 0 


rev count 


ts2&^tt5S(c pci j; ►> - i 2*1,3 


24 


rev_use_cou 
nt 




25 


rev_the 

coprocesa 
ort 


= y a -t? y iHc «fc 5 7* o 7 A * ftfc i¥ 
#l"l35j&> rev_use_count f y h >y 


31 : 26 


reserved 





c t imerO 



H 1.69 pictimerO J—lV K 









31 : 0 


timerO 





c t i m e r l 
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pic_timerl l/^»7^-/UK 









31 : 0 


tiaerl 


& #J GO 7* — ^^bx^^^tf)*^ A 
7!> htO?>fTf CI ^ n y # <D 



7^^y^3yhP-7^W bs. dcc__cfg2 

br. dec cfgl 



&1.71 dec cfgl \s*J>?s $7 <j — As K 









l 


E 




2 


B 


0=s< /< ;* L ft ^ 
1=/M /^f5 



dcc_cfg2 V^^7^wl-K 
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[10:8] 



Autoinv 



Autofill 



Lock 



t 



ss 



mode 



hRffGOlig) 

1-7^-^1 >o<b£$re&o C^rV -yv^-^oTV 



) 

1=JPEG ftrHt 



6=64fc^ HE— H 
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cache_mis 






s_inst 
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bt. dcc_stat r 

*1.73 dec stat " 



bv. dec err 









[3:0] 


error [3:0] 




[p:qJ 


state [7:0] 




_int 

^ 1 .74 dec err 






«* 


mm 


0 

i n t e- 


error 

h »- 





^1,75 dec err int en K 



(fc's>h) 




m 


0 


eriot en 





bw. dec lvO 

8c 1.76 dec lvO U $ 7 <i —iV K 



(try h) 




mm 


[31:0] 


lvol31:0] 





bx. dec lvl 

*1.77 dcclvl 9 74— A>Y 



(try » 


43* 




[31:0] 


lvl [31:0] 




$1.78 dec lv2 U^X*?^— A*K 




m 




[31:0] 


lv2[31:0] 





bz. dec lv3 
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gll.79 dec lv3 U^^7 J —A* K 









[31:0] 


lv3(31:0] 


2 7*>P>9 64Wft»fcrs/ h 



c a . dec addr 



3ft 1. BO dcc_addr US^;* * ;7 ^ ^ A' K 



cc. dec rad 









[31:0] 


decjiddr 


miter vuy*swMr- Kr«fflSti^<-^ 


drb 




dcc_raddrb 


7-f-vuK 




m 


[31:0] 


dec .raddxb 




d r c 








S 1.82 


dec raddrc l^S?* 9 ? <i — K 












131:0] 


dec raddrc 


> Vfr-jfai-Fcoyr Kv^wKH 





cd, dec test 



* 1.83 dcc_test l^*»:7«f— K 



74— /UK 


m 


m \ 


0 


drive_odata 


testjaode ¥y b&0\cty r£*Vtl/^t#(0 


1 


o__ackb 


testraode 0 h^0(^ir-y h*ft"tl 

DCC He— KTdrivejDdata 

1-1 **<7»1 o ackb £ji#rf 5 
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o ackc 



h^9k CCC y^T* h^E— FT' drive jodata & 
1= 1 ^ p o ackc ^IJfc'Tf 5 



[5:3] testjrcde O=odata /^^E— F»5 (!Ji?y M^ptDfiD 

1=<XB 7 KV*£ IIS i_ce=0 
2O0C 7 FV*£ IIS KtftfX i_oe=0 
3=IIS 7*-*$: o_datab[31 :0] (OtWs i_pe=l 
3=IIS7^^^ro datac[31:0]&:#ift i oe=l , 



^7 v 



ce. oon cfg (oob cfg = 0x70, 

ooc_cfg = 0x8 0) 



1.84 oon cfg Uifxpy <j— /VK 



(tV h) 






[31:9] 


operate 


reserved 


[81 






[7:2] 




reserved 


[L] 


E 


i=^^-^>i^ 


[0] 




reserved 


t ( o o b_ 
x 8 1) 


c f g = 0 x 7 


^1.85 oon_stat V 


O^y M 






[31:24] 


state(7:0] 


reserved 


[23:16] 




\fv b 1=00* WU KW* 
) 

fcfo,h2=Firc)3£ 






[15:0] 




reserved 
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eg. oon__err_int (oob_er r_i nt=0x72, e r r i n t = 0 x 8 2 ) 

^1.86 oon err int \s*J?*&*7 4 —As K 



/UK 




S8PJ 




error 


try K)=r -< ^.^-^u^'OOirj; 9 O Bus g 

\fy v i=«dp t jpeg <mfm&r~?*m$mi 



ch. oon err int en (oob er 

r int_en = 0x73, err__int__en = 



0x83) 

^1.87 oon err int en ^?7^^K 









[15:0] 


error 


try h o=f*-f xx— y/U^^cOOt J: 9 OBus 

saw**** 

f y f- 1=MDP t JPEG#§^^*7 , -^*1^ei 



c i, oon dmr (oob dmr=0x74, 

ooc dmr=0x84) 



^1.88 oon drar ^^7^-/VK 
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m 




[31 : 30] 




i=/^H^b 


[29 : 28] 


ls2 


'Vb2£>l^V*7*y7'': 
0=;<Y hOrfpfe 

3-v<0 3ri*fc 


[27 : 26] | 


lsl 




[25:24] 


lsfl 


3=/^ h 3 


[23 : 20} 


s.iieii[3:0] 


l=oon_subst \Zffl&£titL{\$i$: Z(D'< 


[19 : 15] 


replicate 
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[14 : 12] 


norma liae 


o=r***%it.9 1 e<? b 

l = ff^fct) 2 tTy h 
2=^^*/uSfc9 4 If y b 
3 = f^/USfc* 8 tf y b 
4~7=^-r*/l^;fc9 1 6 fcT-y h 


[11] 
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[10:8] 


bo [2:0] 


*|*try b^W bJftO&y bJTT-ty b 


[7] 


P 


$bU7*~^V b : 
l=W^^x^ bffrWZftt&b 


[6:5] 


if 


ftU?*— v b > 
o=B$ 


[4:3] 


CC 


JJif-r&irt&yb • 
0=4 T?T4 7? J r*/i' 
1=1 TP^n? 1 ?*?^ 
2=2 Ttt*r<i*f*f'**>\' 
3=3 79T4~??-**fV 


[2] 


L 


0=oo_said \s&X>9 9 
l=ffs( \s9bT b'Uyz/^y 




[i:0] 
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c j . o 6 


n subst 


(oob sub s t = 


0 


ooc__cdp = 0x86) 




x 7 5 , o o 


c subst 


= 0x8 5) 




* 1.90 


oon cdp \s*/*9? <{*~ /UK 


Si. 89 oon subst \si?*97 4*~ /UK 
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[31:0] 


subst ! 
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~7 y — A, IS 
y -1 /*^r 

&y b) 






[31:0] 


len 





cm. oon_said ( o o b__ s a i d = 0 x 7 8, o o c _ s a i d = 0 x 8 8 ) 

^1.92 oon said /UK 



/uK 




m 


[31:0] 
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cn. o o n__ t i 1 e (oob_t i 1 e=0x7 

9, o o c_t i 1 e = 0 x 8 9) * 1.93 oon_tile U^?7^^K 



y><— /uK 




tS$8 


[31:16] 


offoet[15;0J 




[15:0] 


tlle_len[15:0] 


m v >ywm>*^7> hem* 

5ftc? (length) -1 
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74- /UK 




m 
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[22 : 21] 


dnan 
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2=3 fHr^VU 
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[19] 


F 
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cp. p o_ s tat 
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S 
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i=sa^h^-K 
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4# 


CM 


[31 : 27] 
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[25] 


po_stall 
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[24] 
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[20] 
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[19] 
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[173 
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[16] 
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1.96 po_err_int 
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gfc 1 . 97 po_err_int_en V #7 4*-/^ K 
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am 
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[19:15] 
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[14;12] 


norznfacto 
rI2:0] 


^-nv^^v 1 6 tr-/ h 


[11] 
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[10: 8) 


bo[2:0] 


hTKUyvC/^big endian 


[7] 


P 


i =Mrt ^ <yf &ntz* h y -A 


[6:5] 


if[liO] 


0=IB@i 

l ^<y*&lvCV>fcVvV h 


[4:3] 


cc(l:0] 


0=4 T^ir^^-r^U 
1=1 Tt^ritt^lV 
2=2 7^r^/ft^ 


[2] 


L 


0=^7>K (short h) 
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[1:0] 
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i=MSfc 



c t . p o s u b s t 
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(try h) 
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31:0 
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c u. p o c d p 
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^l.lOQ po_cdp ^»7^-;wK 



c v. p o_l e n 
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31:0 


ec(p[31:0] 




$ 1.101 po len l/^#7-f-/Hf 


(tfy M i 






31:0 


len[31:0J 





cw. p o_s a i d 



^1,102 gosaid Vi>7*j!y 4— )\s K 



ex. p o i d r 



(try W 




saw 


31:0 


ad 





S 1.103 po_idr l/^^7^-/l/K 



c z . p o m u v 









31:0 


*ddth(31:l6] 
height [15:0} 


SWfeEBai&Jc: width 4— 1 -f-5 
WSH&nyJ l^mc height Sr- 1 1" 


a 1 i d 
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[31:0} 
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mm 


[31:113 
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S 1.106 mdpcfg U'^^7^^K 
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I 
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2 
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3 


T 


l=6E-;r.*Mrr-K 




14 
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rt^-fej/ NTs^Stfufri] dp_opl,m dp_op2 


15 


operate 




17 


w oid_hpu t 




18 


bng_km 1 
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19 


bbndgen 
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hstmctbn [ 
15 0] 
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0 


po valid 


pojraiidfSSv^cyai 


1 


po_flnal 


po final ft^v3t#§ 


2 


po stall 


po stall fH^Wi 


3 


oob valid 


oob valid ff^v^ffl 


4 


OOb_req 


octo_reqft^i73cKSg 


5 
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0=00B jy?— 

1-00B «{>"!?-7^-X^m&bV 


6 
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ooc valid ^v^tflii 


7 
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8 


ooc^pendincr 


0=OOC -f 7i- 5*JWWCL 

i=ooc ^^-^x-^^S^t) 


9 


ro_ysalld 


ro valid {f^ftS 


10 


ro final 


ro final «^aWH 


11 


ro stall 


ro stall fi^W31 


12 :13 


nat2rul_stae[l 
:0j 


fi^l^Wg: 
0=7V p/t, 


14 :15 


int_stat[l;0J 


@StgW0?«^: 
2=9-4 f/ts 2 


16 






17 : 18 


addgenj3tate[ 
1:0) 


1= po frtt^y % s V7*yf- 

2=T KU*3S£ 


19:20 


ranp ..state 


l=IWN-TOS 



dc. mdp__err int 

i-^TOtfy httOlC!) -fey h£ix5 0 Rl.108 mdp_err_int 



-199- 



(try h) 






7:0 


error [7 : 0] 


error [0] ^MSyiMDP ^^trt^ 
error [ 1 } =7M ;* T/I^Smdp =2^^ y*tf- 
error [2] =7^ ://Uj§BttMDP ^yot^/f 


24 : 8 


exception I 15 
:0] 


exepticn [0] =^^/W 0 irfT&gT 
exoeption f 1 ] 1 (OP 7 >~?V<nTls#— 
exceptiont{2J =?^VU2 <D?7^fX*<DT^# 
exception [3] 3 (Dp^^TV<OT>^ 
exception [ 4 3 =x MgfrXT>y—y p— 
exception [5] =yl|(Ory^7P- <fiK^& 



mdp_err_int_en t^tOt*y MiOKiy-fey h£ft£ 0 



^1,109 mdp__err int en 1/^ $y 4 — A- K 







Wffl 
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7:0 


errj»askl7:0 






] 














err_raask[0] =7Vf ^^^//l-fSteMDP dS^p-fc 












ff 






exrjnask[l] =r Xm— TVl^iMDP jd*n:/p-f? 


















errjnask[2] ==7^ y^r.<—-fjV$&-£ MDP ^ = 7"p-£ 
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expjnask[0] =^"-y^VV 0 <D# y s^?V<DTl/*f^ 
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a 1.110 mdp_tegt 





-tun* 


itSBH 


V.L- y V) 






1 * n 
o ■ U 


test_d3ital3 : 


7V* pvWFO ^JJ7^ — v^tlS 




0) 


0— 3=EO 7*--£ r fl*£>(DBQ7 flL ^ (0.1,2,3 <7>fe;y 












4=^oab_data [31 - 0] i)^h(OPO^r — & 






5=oob_data [63 • 32] 7b*t*<D RO 






6=oob_data [95 : 64] RO 7^-^ 






7=cx±>_data [127 : 96] fafyD RO 7^^ 






8— coc__data L31 • OJ ^WBOT^v 






y — OOC data Lo.3 • ZPbvJ RO 7^ — %? 






10=ooc_data &5 : 64] fabtDBQy**-? 






ll=oocdata Q27 :96] fr<b<DB07*-# 








25 : 4 
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29:26 
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3:0] 




ol: JU 
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l-V % w V^gg(iLX* P0"-KXB--C0C~TO 






2-P0, 00B, OOC fr&OT^#<Q&T : iif/<< Wl^ 
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* l.lli 


mdp_opl ^*7-{^K 




** 
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2 :o 


ppb_rrodeA[2:0] 




5:3 


ppb nodeB[2:0] 




8 : 6 


phajoxi£A[2:0] 


-K 


11 : 9 


pba_jncdeB(2:0J 


-K 


14 : 12 


pbajrodeC(2:0] 


— K 


17 :i5 


pbb_noda\[2:0] 


— K 


20 :i8 


pbb_nodeB[2:0] 


— K 


23:21 


pbbjrodsc[2:0] 


^t^bM/p vtw&mmf* yfce?e 
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27 :24 
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2=reserv©d 
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12-13=&fifc 
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28 
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1=1*81 


29 
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30 
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31 
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1=1 


3:2 


iruL_B[l:0) 


0=0 
1=1 


4 
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5 
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6 
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7 
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8 
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9 


bgjhe 
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10 


matjthe 
ocprocessort 


mdpii rt^fr^JSS^x— 

o pofinal fi-§iCfc IP Y^*^~irQ9 ]) 

rah*, (mm^mix<D^m 


11 


intthe 
ooprocessoxt 


=r7x2±y^¥yh<7mfflo ^n^ifrD 
MDPII F*W^IW^— po_final 


12 


int_size 


o=v^S*i*y<Y bifida 


13 


tnt_4 


i =4 -^co^s^^ 
) 


14 
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i=siara 


15 


lock step 


o=po, oob t ooc 

i=po, oob t ooc 4y?—y^—xti\g\ ^ 
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21 :20 


cwcoofgiCl: 
0] 




22 


fr_en 


1 =^^r//u- 


24 -23 


oobjnode[l:0 
] 


OCB A >?~7x.~XWfe*— K 

i=*R 
2=5>^ 


26 :25 


ooc_mode[l:0 
] 


ooc <Y y xifrf^p— k 

ih^ 

2=GCSC 




30:27 


trms(3 :0] 
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m^prpor: 0000 T$JMk 
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■^-V^V^ 1 <DJE^< Is— 9 Qjfr~7^ y Y 
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offset2{7;0] 


^^T^^2cDiF>r^l^ — yOtt/^zy h 


[31 : 24] 


offset3(7:0] 




[31 : 0] 


offset [31 :0] 


iE^convolution) htiBbfftftf-ty Y 



d i mdp_bi -t^XWy bitOiC]) -by h£ <OlCJ8^bft3 0 

H 1.114 mdpbi ^jg*g7^»-/kK (ffrd^E" K) 
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[7:0] 


blenderidO 


O^gtf^^i? y IMS 


[15:8] 


blendsndl 
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blendend2 




[31 :24] 


blendend3 
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m 
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blendendO 
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blendendl 
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blendend2 
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blendend3 
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m 


[3 : 0] 


rowsf3:01 
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cols[3:0] 
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(15:8) 


level [7:0] 
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[31 : 0] 
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JPEGfffMl/-^^ d 1 j c_c f g 



M 






131 :28] 


mop 




[27] 


D 


D=JPBG S5^ 
1=JPEG 


[26] 


M 




[25] 


4 


1=4^^ 


[24] 
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[23] 
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[22} 
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[14] 
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[7] 
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[6] 


Z 
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[5:4] 
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[23] 


op 
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0 
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[6] 
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1. TITLE OF THE INVENTION 
5 COMPRESSION APPARATUS AND METHOD THEREFOR 



10 



15 2. CLAIMS 

1. An apparatus for decoding blocks of data encoded using a 
plurality of variable-length code words, said blocks of data also comprising 
not-encoded fields of fixed length, said code words being interleaved with not- 
20 encoded bit fields of variable length, said apparatus comprising: 

a preprocessing logic unit for removing a plurality of not-encoded fields 
of fixed length and outputting said plurality of variable-length code words 
interleaved with said not-encoded bit fields of variable length, and outputting 
signals indicating positions of said plurality of not-encoded fields of fixed 
25 length in the blocks of data; 

means for passing said signals indicating positions to an output of the 
apparatus in a way synchronous with the data being decoded, such that if 
there is a multiplicity of variable-length coded data between said not-encoded 
fields of fixed length on the input of the apparatus, a multiplicity of 
30 corresponding decoded data appears on the output of the apparatus between 
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any two signals indicating positions corresponding to said not-encoded words 
of fixed length on the input. 

2. The apparatus according to claim 1, further comprising: 

a first processing unit comprising a first set of barrel shifters and a 
first register, wherein said first processing unit processes said outputted 
plurality of variable-length code words interleaved with said not-enoodedbit 
fields of variable length; and 

a second processing unit comprising a second set of barrel shifters and 
a second register, said second processing unit for processing said outputted 
signals indicating positions of said not-encoded words of fixed length in the 
blocks of data: 

wherein said first and second processing units are identical, and the 
output of said respective barrel shifters and said units receive identical 
control signals. 

3. The apparatus according to claim 2, wherein the output of said 
second processing unit for processing signals indicating positions of said not- 
encoded words of fixed length is used to determine the size of a not-encoded 
variable length field to be removed from the data stored in a data register for 
decoding purposes. 

4. The apparatus according to claim 1, wherein said 
preprocessing logic unit for removing said not-encoded fields of fixed length 
outputs a plurality of variable-length code words interleaved with said not- 
encoded bit fields of variable length as words of fixed length composed of bit 
fields of fixed length such that a single bit field has a corresponding tag 
indicating if the outputted field is passed or removed by said preprocessing 
unit, or passed by said preprocessing unit and following or preceding a marker 
being a not-encoded field of fixed length. 
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5. The apparatus according to claim 1, wherein said blocks of 
data are encoded using Huffman coding. 

5 6. A method of decoding blocks of data which have been encoded 

using a plurality of variable-length code words, said blocks of data comprising 
not-encoded fields of fixed length, said code words being interleaved with not- 
encoded bit fields of variable length, said method comprising the steps: 

removing said not-encoded fields of fixed length, and outputting said 
10 plurality of variable-length code words interleaved with said not-encoded bit 
fields of variable length, and outputting signals indicating positions of the 
said not-encoded words of fixed length in the blocks of data; 

passing said signals indicating positions to an output in a way 
synchronous with the data being decoded, such that if there is a multiplicity of 
15 variable-length coded data between said not-encoded fields of fixed length 
being input, a multiplicity of corresponding decoded data appears at the 
output between any two signals indicating positions corresponding to said 
not-encoded words of fixed length at the input. 

20 7. The method according to claim 6, further comprising the steps 

of: 

processing said outputted plurality of variable-length code words 
interleaved with said not-encoded bit fields of variable length using a first 
processing unit comprising a first set of barrel shifters and a first register; 
25 and 

processing said outputted signals indicating positions of said not- 
encoded words of fixed length in the blocks of data using a second processing 
unit comprising a second set of barrel shifters and a second register; 
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wherein said first and second processing units are identical, and the 
output of said respective barrel shifters and said units receive identical 
control signals. 

8. The method according to claim 7, further comprising the step 
of determining the size of a not-encoded variable length field to be removed 
from the data stored in a data register for decoding purposes dependent upon 
the output of said second processing unit for processing signals indicating 
positions of 9aid not-encoded words of fixed length. 

9. The method according to claim 6, further comprising the step 

of: 

outputting, by said preprocessing logic unit for removing said not- 
encoded fields of fixed length, a plurality of variable-length code words 
interleaved with said not-encoded bit fields of variable length as words of 
fixed length composed of bit fields of fixed length such that a single bit field 
has a corresponding tag indicating if the outputted field is passed or removed 
by said preprocessing unit, or passed by said preprocessing unit and following 
or preceding a marker being a not-encoded field of fixed length. 

10. The method according to claim 6, wherein said blocks of data 
are encoded using Huffman coding. 

11. A discrete cosine transform (DCT) apparatus, comprising: 
a transpose memory means; and 

an arithmetic circuit interconnected with said transpose memory 
means, said arithmetic circuit consisting of a combinatorial circuit for 
calculating a DCT without clocked storage means. 
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12. The DCT apparatus according to claim 11, wherein said 
combinatorial circuit comprises a predetermined number of 6tages for 
implementing said DCT, said stages arranged sequentially. 

13. The DCT apparatus according to claim 11, further comprising 
means for multiplexing input data provided to said apparatus and data 
output by said transpose memory means. 

14. The DCT apparatus according to claim 11, further comprising 
means for controlling operation of said DCT apparatus. 

15. An inverse discrete cosine transform (DCT) apparatus, 
comprising: 

a transpose memory means; and 

an arithmetic circuit interconnected with said transpose memory 
means, said arithmetic circuit consisting essentially of a combinatorial circuit 
for calculating an inverse DCT without clocked storage means. 

16. The inverse DCT apparatus according to claim 15, wherein 
said combinatorial circuit comprises a predetermined number of stages for 
implementing said inverse DCT, said stages being arranged sequentially. 

17. The inverse DCT apparatus according to claim 15, further 
comprising means for multiplexing input data provided to said apparatus and 
data output by said transpose memory means. 

18. The inverse DCT apparatus according to claim 1 5, further 
comprising means for controlling operation of said DCT apparatus. 
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19. A method of discrete cosine transforming (DCT) data, said 
method comprising the steps of: 

calculating a DCT of input data in accordance with a first orientation 
of said data using an arithmetic circuit consisting essentially of a 
combinatorial circuit for calculating said DCT without clocked storage means; 

storing said transformed input data in accordance with said first 
orientation in a transpose memory means interconnected with said 
combinatorial circuit; and 

calculating a DCT of said transformed input data stored in said 
transpose memory means in accordance with a second orientation of said data 
using said arithmetic circuit to provide transformed data. 

20. The method according to claim 19, wherein said DCT is 
calculated in a predetermined number of stages, said stages arranged 
sequentially. 

2 1. The method according to claim 19, further comprising the step 
of multiplexing input data provided to said apparatus and data output by 
said transpose memory means. 

22. A method of inverse discrete -cosine transforming (DCT) data, 
said method comprising the steps of: 

calculating an inverse DCT of input coefficients in accordance with a 
first orientation of said coefficients using an arithmetic circuit consisting 
essentially of a combinatorial circuit for calculating said inverse DCT without 
clocked storage means; 

storing said inverse transformed input coefficients in accordance with 
said first orientation in a transpose memory means interconnected with said 
combinatorial circuit; and 
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calculating an inverse DCT of said transformed input coefficients 
stored in said transpose memory means in accordance with a second 
orientation using said arithmetic circuit to provide output inverse 
transformed data. 

23. The method according to claim 22, wherein said inverse DCT is 
calculated in a predetermined number of stages, said stages arranged 
sequentially. 

24. The method according to claim 22, further comprising the step 
of multiplexing input data provided to said apparatus and coefficients output 
by said transpose memory means. 

25. A recording medium storing computer-executable program 
codes in which a method according to any one of claims 6 - 10 and 19 - 24 is 
implemented. 



\ 
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3. DETAILED DESCRIPTION OF THE INVENTION 
FIELD OF THE INVENTION 

The present invention relates to decoders for decoding codes of 
variable length that are interleaved with variable-length bit fields not being 
encoded, some of which are passed unchanged through the decoder. The 
length of the not-encoded bit fields are equal to or greater than zero. 

The present invention relates to a discrete cosine transform (DCT) 
apparatus utilizing a data path, which contains no pipelining or storage 
means, and is able to operate at high speeds. 

BACKGROUND OF THE INVENTION 

Generally, large amounts of data being compressed and decompressed 
for numerous reasons, including transmission, storage, retrieval, and 
processing use at some stage means of variable-length coding, such as 
Huffman coding. Huffman coding was originally disclosed by DA. Huffman 
in an article "A Method for the Construction of Minimum Redundancy Codes" 
Pmc. IRE, 40: 1098, 1952. In many cases, variable-length codes in an 
encoded bit stream are not contiguous, but are interleaved with other not- 
encoded bit fields. The bit fields may represent control and/or formatting 
information, and/or provide additional specification for encoded data including 
marker headers, marker codes, stuff bytes, padding bits and additional bits in, 
for example, JPEG encoded data. 

Variable length encoding allocates codes of different lengths to 
different input data according to the probability of occurrence of the input 
data, so that statistically more frequent input codes are allocated shorter 
codes than the less frequent codes. The less frequent input codes are 
allocated longer codes. The allocation of codes may be done either statically 
or adaptively. For the static case, the same output code is provided for a 
given input datum, no matter what block of data is being processed. For the 
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adaptive case, output codes arc assigned to input data based on a statistical 
analysis of a particular input block or set of blocks of data, and possibly 
changes from block to block (or from a set of blocks to a set of blocks). 

Significant problems arise when high speed decoding of variable 

5 length codes is required. Problems particularly arise when the encoded 
stream contains variable-length bit fields that are not encoded are 
interleaved with encoded data as in, for example, the JPEG standard. 
Greater difficulty in fast decoding of such variable-length encoded data occurs 
when the length of a particular not-encoded bit field can be determined only 

10 after the preceding (encoded) datum is fully decoded, as in JPEG standard. 
This generally excludes direct pipelining from being incorporated into a 
decoder, because the position of the beginning of the next encoded datum is 
known only after the preceding one is fully decoded. 

Existing solutions either require several steps (clock cycles) to decode 

15 a single input datum, which renders them too slow for many applications, or 
use iterative units to decode more than one symbol quasi-simultaneously, i.e. 
in one stage (clock cycle). However, additional decoding blocks often make 
such decoders economically unjustifiable, and further are not necessarily fast 
enough. This is because multiple decoding units do not work fully in parallel, 

20 since the beginning of processing in the next unit still depends on the result of 
processing in the previous one which determines the beginning of the input 
data for the next one. Thus, even if decoding of several symbols takes place 
in one stage (clock cycle), that. stage (clock period) is relatively long and the 
whole decoder is too slow for many applications. 

25 Therefore, a need clearly exists for a decoder that can decode variable- 

length codes interleaved with variable-length bit fields that are not encoded 
overcoming one or more of the disadvantages of conventional decoders. 

Typically, a discrete cosine transform (DCT) apparatus as shown in 
Fig. 77 performs a full two-dimensional (2-D) transformation of a block of 8x8 

30 pixels by first performing a 1-D DCT on the rows of the 8x8 pixel block. It 
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then performs another 1-D DCT on the columns of the 8x8 pixel block. Such 
an apparatus typically consists of an input circuit 1096, an arithmetic circuit 
1104, a control circuit 1098, a transpose memory circuit 1090, and an output 
circuit 1092. 

The input circuit 1096 accepts 8-bit pixels from the 8x8 block. The 
input circuit 1096 is coupled by intermediate multiplexers 1100, 1102 to the 
arithmetic circuit 1004. The arithmetic circuit 1104 performs mathematical 
operations on either a oomplete row or column of the 8x8 block. The control 
circuit 1098 controls aU the other circuits, and thus implements the DCT 
algorithm. The output of the arithmetic circuit is coupled to the transpose 
memory 1090, register 1095 and output circuit 1092. The transpose memory 
is in turn connected to multiplexer 1100, which provides output to the next 
multiplexer 1102. The multiplexer 1102 also receives input from the register 
1094. The transpose circuit 1090 accepts 8x8 block data in rows and 
produces that data in columns. The output circuit 1092 provides the 
coefficients of the DCT performed on a 8x8 block of pixel data. 

In a typical DCT apparatus, it is the speed of the arithmetic circuit 
1104 that basically determines the overall speed of the apparatus, since the 
arithmetic circuit 1104 is the most complex. 

The arithmetic circuit 1104 of Fig. 77 ib typically implemented by 
breaking the arithmetic process down into several stages as described 
hereinafter with reference to Fig. 78. A single circuit is then built that 
implements each of these stages 1114, 1148, 1152, 1156 using a pool of 
common resources, such as adders and multipliers. Such a circuit 1104 is 
mainly disadvantageous due to it being slower than optimal, because a single, 
common circuit is used to implement the various stages of circuit 1104. This 
includes a storage means used to store intermediate results. Since the time 
allocated for the clock cycle of such a circuit must be greater or equal to the 
time of the slowest stage of the circuit, the overall time is potentially longer 
than the sum of all the stages. 
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Fig. 78 depicts a typical arithmetic data path, in accordance with the 
apparatus of Fig. 77, as part of a DCT with four stages. The drawing does 
not reflect the actual implementation, but instead reflects the functionality. 
Each of the four stages 1144, 1148, 1152, and 1156 is implemented using a 
single, reconfigurable circuit. It is reconfigured on a cycle -by- cycle basis to 
implement each of the four arithmetic stages 1144, 1148, 1152, and 1156 of 
the 1-D DCT. In this circuit, each of the four stages 1144, 1148, 1152, and 
1156 uses pool of common resources (e.g. adders and multipliers) and thus 
minimises hardware. 

However, the disadvantage of this circuit is that it is slower than 
optimal. The four stages 1144, 1148, 1152, and 1156 are each 
implemented from tho same pool of adders and multipliers. The period of 
the clock is determined by the speed of the slowest stage, which in this 
example is 20 ns (for block 1144). Adding in the delay (2ns each) of the input 
and output multiplexers 1146 and 1154 and the delay (3ns) of the flip-flop 
1150, the total time is 27 ns. Thus, the fastest this DCT implementation 
can run at is 27 ns. 

Pipelined DCT implementations are also well known. The drawback 
with such implementations is that they require large amounts of hardware to 
implement. Whilst the present invention does not offer the same 
performance in terms of throughput, it offers an extremely good 
performance/size compromise, and good speed advantages over most of the 
current DCT implementations. 

Therefore, a need clearly exists for an improved DCT/ inver9e-DCT 
method and apparatus that is able to overcome one or more disadvantages of 
conventional techniques. In particular, a need clearly exists for a method and 
apparatus that is able to reduce the time taken for the main arithmetic 
circuit in a DCT/inverse-DCT apparatus to calculate required results, thereby 
improving the overall performance of the DCT or inverse DCT. 
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SUMMARY OF THE INVENTION 

In accordance with a first aspect of the invention, there is provided an 
apparatus for decoding blocks of data encoded using a plurality of variable- 
length code words, the blocks of data also comprising not-encoded fields of 
fixed length, the code words being interleaved with not-encoded bit fields of 
variable length, the apparatus comprising: 

a preprocessing logic unit for removing a plurality of not-encoded fields 
of fixed length and outputting the plurality of variable-length code words 
interleaved with the not-encoded bit fields of variable length, and outputting 
signals indicating positions of the plurality of not-encoded fields of fixed 
length in the blocks of data; 

means for passing the signals indicating positions to an output of the 
apparatus in a way synchronous with the data being decoded, such that if 
there is a multiplicity of variable-length coded data between the not-encoded 
fields of fixed length on the input of the apparatus, a multiplicity of 
corresponding decoded data appears on the output of the apparatus between 
any two signals indicating positions corresponding to the not-encoded words of 
fixed length on the input. 

Preferably, the apparatus further comprises: a first processing unit 
comprising a first set of barrel shifters and a first register, wherein the first 
processing unit processes the outputted plurality of variable-length code 
words interleaved with the not-encoded bit fields of variable length; and a 
second processing unit comprising a second set of barrel shifters and a second 
register, the second processing unit for processing the outputted signals 
indicating positions of the not-encoded words of fixed length in the blocks of 
data; wherein the first and second processing units are identical, and the 
output of the respective barrel shifters and the units receive identical control 
signals. Further, the output of the second processing unit for processing 
signals indicating positions of the not-encoded words of fixed length may be 
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used to determine the size of a not -encoded variable length field to be removed 
from the data stored in a data register for decoding purposes. 

Preferably, the preprocessing logic unit for removing the not-encoded 
fields of fixed length outputs a plurality of variable-length code words 
interleaved with the not-encoded bit fields of variable length as words of fixed 
length composed of bit fields of fixed length such that a single bit field has a 
corresponding tag indicating if the outputted field is passed or removed by the 
preprocessing unit, or passed by the preprocessing unit and following or 
preceding a marker being a not-encoded field of fixed length. 

Preferably, the blocks of data are encoded using Huffman coding. 

In accordance with a second aspect of the invention, theie is provided a 
method of decoding blocks of data which have been encoded using a plurality 
of variable-length code words, the blocks of data comprising not-encoded fields 
of fixed length, the code words being interleaved with not-encoded bit fields of 
variable length, the method comprising the steps: 

removing the not-encoded fields of fixed length, and outputting the 
plurality of variable-length code words interleaved with the not-encoded bit 
fields of variable length, and outputting signals indicating positions of the 
not-encoded words of fixed length in the blocks of data; 

passing the signals indicating positions to an output in a way 
synchronous with the data being decoded, such that if there is a multiplicity of 
variable -length ooded data between the not-encoded fields of fixed length 
being input, a multiplicity, of corresponding decoded data appears at the 
output between any two signals indicating positions corresponding to the not- 
encoded words of fixed length at the input. 

Preferably, the method further comprises the steps of: processing 
the outputted plurality of variable-length code words interleaved with the not- 
encoded bit fields of variable length using a first processing unit comprising a 
first set of barrel shifters and a first register; and processing the outputted 
signals indicating positions of the not-encoded words of fixed length in the 
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blocks of data using a second processing unit comprising a second set of barrel 
shifters and a second register; wherein the first and second processing units 
are identical, and the output of the respective barrel shifters and the units 
receive identical control signals. Further, the method may comprise the step 
of determining the size of a not-encoded variable length field to be removed 
from the data stored in a data register for decoding purposes dependent upon 
the output of the second processing unit for processing signals indicating 
positions of the not-encoded words of fixed length. 

Preferably, the method may further comprise the step of outputting, 
by the preprocessing logic unit for removing the not-encoded fields of fixed 
length, a plurality of variable -length code words interleaved with the not- 
encoded bit fields of variable length as words of fixed length composed of bit 
fields of fixed length such that a single bit field has a corresponding tag 
indicating if the outputted field is passed or removed by the preprocessing 
unit, or passed by the preprocessing unit and following or preceding a marker 
being a not-encoded field of fixed length. 

Preferably, the blocks of data are encoded u9ing Huffman coding. 

In the following detailed description, the reader's attention is directed, 
in particular, to Pigs, 82 to 91 and their associated description without 
intending to detract from the disclosure of the remainder of the description. 

In accordance with a third aspect of the invention, there is provided a 
discrete cosine transform (DCT) apparatus, comprising; a transpose memory 
means; and an arithmetic circuit interconnected with the transpose memory 
means, the arithmetic circuit consisting of a combinatorial circuit for 
calculating a DCT without clocked storage means. 

Preferably, the combinatorial circuit comprises a predetermined 
number of stages for implementing the DCT, the stages arranged 
sequentially. 
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Preferably, the DCT apparatus further comprises means for 
multiplexing input data provided to the apparatus and data output by the 
transpose memory means. It may also comprise means for controlling 
operation of the DCT apparatus. 

In accordance with a fourth aspect of the invention, there is provided 
an inverse discrete cosine transform (DCT) apparatus, comprising: a 
transpose memory means; and an arithmetic circuit interconnected with the 
transpose memory means, the arithmetic circuit consisting essentially of a 
combinatorial circuit for calculating an inverse DCT without clocked storage 
means. 

In accordance with a third aspect of the invention, there is provided a 
method of discrete cosine transforming (DCT) data, the method comprising 
the steps of: 

calculating a DCT of input data in accordance with a first orientation 
of the data using an arithmetic circuit consisting essentially of a 
combinatorial circuit for calculating the DCT without clocked storage means; 

storing the transformed input data in accordance with^ the first 
orientation in a transpose memory means interconnected with the 
combinatorial circuit; and 

calculating a DCT of the transformed input data stored in the 
transpose memory means in accordance with a second orientation of the data 
using the arithmetic circuit to provide transformed data. 

Preferably, the DCT is calculated in a predetermined number of 
stages, the stages arranged sequentially. 

The method may also comprise the step of multiplexing input data 
provided to the apparatus and data output by the transpose memory means. 

In accordance with a fourth aspect of the invention, there is provided a 
method of inverse discrete-cosine transforming (DCT) data, the method 
comprising the steps of: 
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calculating an inverse DCT of input coefficients in accordance with a 
first orientation of the coefficients using an arithmetic circuit consisting 
essentially of a combinatorial circuit for calculating the inverse DCT without 
clocked storage means; 

storing the inverse transformed input coefficients in accordance with 
the first orientation in a transpose memory means interconnected with the 
combinatorial circuit; and 

calculating an inverse DCT of the transformed input coefficients 
stored in the transpose memory means in accordance with a second 
orientation using the arithmetic circuit to provide output inverse transformed 
data. 

In the following detailed description, the reader's attention is directed, 
inparticular, to Figs. 79, 80 and 81 and their associated description without 
intending to detract from the disclosure of the remainder of the description. 
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3,0 Description of the Preferred and Other Embodiments 

In the preferred embodiment, a substantial advantage is gained in 
hardware rasterization by means of utilization of two independent instruction 
streams by a hardware accelerator. Hence, while the first instruction stream 
can be preparing a current page for printing, a subsequent instruction stream 
can be preparing the next page for printing. A high utilization of hardware 
resources is available especially where the hardware accelerator is able to 
work at a speed substantially faster than the speed of the output device. 

The preferred embodiment describes an arrangement utilising two 
instruction streams. However, arrangements having further instruction 
streams can be provided where the hardware trade-offs dictate that 
substantial advantages can be obtained through the utilization of further 
streams. 

The utilization of two streams allows the hardware resources of the 
raster image co-processor to be kept fully engaged in preparing subsequent 
pages or bands, strips, eLc., depending on the output printing device while a 
present page, band, etc is being forwarded to a print device. 
3.1 General Arrangement of Plural Stream Architecture 

In Fig. 1 there is schematically illustrated a computer hardware 
arrangement 201 which constitutes the preferred embodiment. The 
arrangement 201 includes a standard host computer system which takes the 
form of a host CPU 202 interconnected to its own memory store (RAM) 203 
via a bridge 204. The host computer system provides all the normal 
facilities of a computer system including operating systems programs, 
applications, display of information, etc. The host computer system is 
connected to a standard PCI bus 206 via a PCI bus interface 207. The PCI 
standard is a well known industry standard and most computer systems sold 
today, particularly those running Microsoft Windows (trade mark) operating 
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systems, normally come equipped with a PCI bus 206. The PCI bus 206 
allows the arrangement 201 to be expanded by means of the addition of one or 
more PCI cards, eg. 209, each of which contain a further PCI bus interface 210 
and other devices 211 and local memory 212 for utilization in the 
arrangement 201. 

In the preferred embodiment, there is provided a raster image 
accelerator card 220 to assist in the speeding up of graphical operations 
expressed in a page description language. The raster image accelerator card 
220 (also having a PCI bus interface 221) is designed to operate in a loosely 
coupled, shared memory manner with the host CPU 202 in the same manner 
as other PCI cards 209. It is possible to add further image accelerator cards 
220 to the host computer system as required. The raster image accelerator 
card is designed to accelerate those operations that form the bulk of the 
execution complexity in raster image processing operations. These can 
include: 



(a) 


Composition 


(b) 


Generalized Color Space Conversion 


(c) 


JPEG compression and decompression 


(d) 


Huffman, run length and predictive coding and decoding 


(e) 


Hierarchial image (Trade Mark) decompression 


(0 


Generalized affine image transformations 


(g) 


Small kernel convolutions 


(h) 


Matrix multiplication 


(i) 


Halftoning 


0) 


Bulk arithmetic and memory copy operations 



.The rast&r imago accolerator card 220 further includes its own local 
memory 223 connected to a raster image co-processor 224 which operates the 
raster image accelerator card 220 generally under instruction from the host 
CPU 202. The co-processor 224 is preferably constructed as an Application 
Specific Integrated Circuit (ASIC) chip. The raster image co-processor 224 
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includes the ability to control at least one printer device 226 as required via a 
peripheral interface 225. The image accelerator card 220 may also control 
any input/output device, including scanners. Additionally, there is provided 
on the accelerator card 220 a generic external interface 227 connected with the 
raster image co-processor 224 for its monitoring and testing. 

In operation, the host CPU 202 sends, via PCI bus 206, a series of 
instructions and data for the creatioil of images by the raster image co- 
processor 224. The data can be stored in the local memory 223 in addition to 
a cache 230 in the raster image co-processor 224 or in registers 229 also 
located in the co-processor 224. 

Turning now to Fig. 2, there is illustrated, in more detail, the raster 
image co-processor 224. The co-processor 224 is responsible for the 
acceleration of the aforementioned operations and consists of a number of 
components generally under the control of an instruction controller 235. 
Turning first to the co-processor's communication with the outside world, 
there is provided a local memory controller 236 for communications with the 
local memoiy 223 of Fig. 1. A peripheral interface controller 237 is also 
provided for the communication with printer devices utilising standard 
formats such as the Centronics interface standard format or other video 
interface formats. The peripheral interface controller 237 is interconnected 
with the local memory controller 236. Both the local memory controller 236 
and the external interface controller 238 are connected with an input interface 
switch 252 which is in turn connected to the instruction controller 235. The 
input interface switch 252 is also connected to a pixel organizer 246 and a 
data cache controller 240. The input interface switch 252 is provided for 
switching data from the external interface controller 238 and local memoiy 
controller 236 to the instruction controller 235, the data cache controller 240 
and the pixel organizer 246 as required. 

For communications with the PCI bus 206 of Fig. 1 the external 
interface controller 238 is provided in the raster image co-processor 224 and is 
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connected to the instruction controller 235. There is also provided a 
miscellaneous module 239 which is also connected to the instruction controller 
235 and which deals with interactions with the co-processor 224 for purposes 
of test diagnostics and the provision of clocking and global signals. 

The data cache 230 operates under the control of the data cache 
controller 240 with which it is interconnected. The data cache 230 is utilized 
in various ways, primarily to store recently used values that are likely to be 
subsequently utilized by the co-processor 224. The aforementioned 
acceleration operations are carried out on plural streams of data primarily by 
a JPEG coder/decoder 241 and a main data path unit 242. The units 241, 
242 are connected in parallel arrangement to all of the pixel organizer 246 
and two operand organizers 247, 248. The processed streams from units 241, 

242 are forwarded to a results organizer 249 for processing and reformatting 
where required. Often, it is desirable to store intermediate results close at 
hand. To this end, in addition to the data cache 230, a multi-used value 
buffer 250 is provided, interconnected between the pixel organizer 246 and the 
result organizer 249, for the storage of intermediate data. The result 
organizer 249 outputs to the external interface controller 238, the local 
memory controller 236 and the peripheral interface controller 237 as required. 

As indicated by broken lines in Fig. 2, a further (third) data path unit 

243 can, if required be connected "in parallel' 1 with the two other data paths in 
the form of JPEG coder/decoder 241 and the main data path unit 242. The 
extension to 4 or more data paths is achieved in the same way. Although the 
paths are "parallel" connected, they do not operate in parallel. Instead only 
one path at a time operates. 

The overall ASIC design of Fig. 2 has been developed in the following 
manner. Firstly, in printing pages it is necessary that there not be even 
small or transient artefacts. This is because whilst in video signal creation 
for example, such small errors if present may not be apparent to the human 
eye (and hence be unobservable), in printing any small artefact appears 
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permanently on the printed page and can sometimes be glaringly obvious. 
Further, any delay in the signal reaching the printer can be equally disastrous 
resulting in white, unprinted areas on a page as the page continues to move 
through the printer. It is therefore necessary to provide results of very high 
quality, very quietly and this is best achieved by a hardware rather than a 
software solution. 

Secondly, if one lists all the various operational steps (algorithms) 
required to be carried out for the printing process and provides an equivalent 
item of hardware for each step, the total amount of hardware becomes 
enormous and prohibitively expensive. Also the speed at which the hardware 
can operate is substantially limited by the rate at which the data necessary 
for, and produced by, the calculations can be fetched and despatched 
respectively. That is, there is a speed limitation produced by the limited 
bandwidth of the interfaces. 

However, overall ASIC design is based upon a surprising realization 
that if the enormous amount of hardware is represented schematically then 
various parts of the total hardware required can be identified as being (a) 
duplicated and (b) not operating all the time. This is particularly the case in 
respect of the overhead involved in presenting the data prior to its calculation. 

Therefore various steps were taken to reach the desired state of 
reducing the amount of hardware whilsL keeping all parts of the hardware as 
active as possible. The first step was the realization that in image 
manipulation often repetitive calculations of the same basic type were 
required to be carried out. Thus if the data were streamed in some way, a 
calculating unit could be configured to carry out a specific type of calculation, a 
long stream of data processed and then the calculating unit could be 
reconfigured for the next type of calculation step required. If the data 
streams were reasonably long, then the time required for reconfiguration 
would be negligible compared to the total calculation time and thus 
throughput would be enhanced. 
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la addition, the provision of plural data processing paths means that 
in the event that one path is being reconfigured whilst the other path is being 
used, then there is substantially no loss of calculating time due to the 
necessary reconfiguration. This applies where the main data path unit 242 
carries out a more general calculation and the other data path(s) carry out 
more specialized calculation such as JPEC coding and decoding as in unit 241 
or, if additional unit 243 is provided, it can provide entropy and/or Huffman 
coding/decoding. 

Further, whilst the calculations were proceeding, the fetching and 
presenting of data to the calculating unit can be prooeeding. This process can 
be further speeded up, and hardware resources better utilized, if the various 
types of data are standardized or normalized in some way. Thus the total 
overhead involved in fetching and despatching data can be reduced. 

Importantly, as noted previously, the co-processor 224 operates under 
the control of host CPU 202 (Fig. 1). In this respect, the instruction controller 
235 is responsible for the overall control of the co-processor 224. The 
instruction controller 235 operates the co-processor 224 by means of utilising 
a control bus 231, hereinafter known as the CBus. The CBus 231 is 
connected to each of the modules 236-250 inclusive to set registers (231 of Fig. 
1) within each module so as to achieve overall operation of the co-processor 
224. In order not to overly complicate Fig, 2, the interconnection of the 
control bus 231 to each of the modules 236-250 is omitted from Fig. 2. 

Turning now to Fig. 3, there is illustrated a schematic layout 260 of 
the available module registers. The layout 260 includes registers 261 
dedicated to the overall control of the co-processor 224 and its instruction 
controller 235. The co-processor modules 236-250 include similar registers 
262. 

3,2 Host/Co-processor Queuing 

With the above architecture in mind, it is clear that there is a need to 
adequately provide for cooperation between the host processor 202 and the 
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image co-processor 224. However, the solution to this problem is general and 
not restricted to the specific above described architecture and therefore will be 
described hereafter with reference to a more general computing hardware 
environment. 

Modern computer systems typically require some method of memory 
management to provide for dynamic memory allocation. In the case of a 
system with one or more co-processors, some method is necessary to 
synchronize between the dynamic allocation of memory and the use of that 
memory by a co-processor. 

Typically a computer hardware configuration has both a CPU and a 
specialized co-processor, each sharing a bank of memory. La such a system, 
the CPU is the only entity in the system capable of allocating memory 
dynamically. Once allocated by the CPU for use by the co-processor, this 
memory can be used freely by the co-prooessor until it is no longer required, at 
which point it is available to be freed by the CPU. This implies that some 
form of synchronization is necessary between the CPU and the co-processor in 
order to ensure that the memory is released only after the co-processor is 
finished using it. There are several possible solutions to this problem but 
each has undesirable performance implications. 

The use of statically allocated memory avoids the need for 
synchronization, but prevents the system from adjusting its memory resource 
usage dynamically. Similarly, having the CPU block and wait until the co- 
prooessor has finished performing each operation is possible, but this 
substantially reduces parallelism and hence reduces overall system 
performance. The use of interrupts to indicate completion of operations by 
the co-processor is also possible but imposes significant processing overhead 
if co-processor throughput is very high. 

In addition to the need for high performance, such a system also has to 
deal with dynamic memory shortages gracefully Most computer systems 
allow a wide range of memory size configurations. It is important that those 
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systems with large amounts of memory available make full vise of their 
available resources to maximize performance. Similarly those systems with 
minimal memory size configurations should still perform adequately to be 
useable and, at the very least, should degrade gracefully in the face of a 
memory shortage. 

To overcome these problems, a synchronization mechanism is 
necessary which will maximize system performance while also allowing co- 
processor memory usage to adjust dynamically to both the capacity of the 
system and the complexity of the operation being performed. 

In general, the preferred arrangement for synchronising the (host) 
CPU and the co-processor is illustrated in Fig, 4 where the reference numerals 
used are those already utilized in the previous description of Fig. 1. 

Thus in Fig. 108, the CPU 202 is responsible for all memory 
management in the system. It allocates memory 203 both for its own uses, 
and for use by the co-processor 224. The co-processor 224 has its own 
graphics-specific instruction set, and is capable of executing instructions 1022 
from the memory 203 which is shared with the host processor 202. Each of 
these instructions can also write results 1024 back to the shared memory 203, 
and can read operands 1023 from the memory 203 as well. The amount of 
memory 203 required to store operands 1023 and results 1024 of co-processor 
instructions varies according to the complexity and type of the particular 
operation. 

The CPU 202 is also responsible for generating the instructions 1022 
executed by the co-processor 224. To maximize the degree of parallelism 
between the CPU 202 and the co-processor 224, instructions generated by the 
CPU 202 are queued as indicated at 1022 for execution by the oo-processor 
224. Each instruction in the queue 1022 can reference operands 1023 and 
results 1024 in the shared memory 203, which has been allocated by the host 
CPU 202 for use by the co-processor 224. 
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The method utilizes an interconnected instruction generator 1030, 
memory manager 1031 and queue manager 1032, as shown in Fig. 5. All 
these modules execute in a single process on the host CPU 202. 

Instructions for execution by the co-processor 224 are generated by the 
instruction generator 1030, which uses the services of the memoiy manager 
1031 to allocate space for the operands 1023 and results 1024 of the 
instructions being generated. The instruction generator 1030 also uses the 
services of the queue manager 1032 to queue the instructions for execution by 
the co-processor 224. 

Once each instruction has been executed by the co-processor 224, the 
CPU 202 can free the memory which was allocated by the memory manager 
1031 for use by the operands of that instruction. The result of one 
instruction can also become an operand for a subsequent instruction, after 
which its memory can also be freed by the CPU. Rather than fielding an 
interrupt, and freeing such memory as soon as the co-processor 224 has 
finished with it, the system frees the resources needed by each instruction via 
a cleanup function which runs at some stage after the co-processor 224 has 
completed the instruction. The exact time at which these cleanups occur 
depends on the interaction between the memory manager 1031 and the queue 
manager 1032, and allows the system to adapt dynamically according to the 
amount of system memory available and the amount of memoiy required by 
each co-processor instruction, 

Fig. 6 schematically illustrates the implementation of the co- 
processor instruction queue 1022. Instructions are inserted into a pending 
instruction queue 1040 by the host CPU 202, and are read by the co-processor 
224 for execution. After execution by the co-processor 224, the instructions 
remain on a cleanup queue 1041, so that the CPU 202 can release the 
resources that the instructions required after the co-processor 224 has 
finished executing them. 
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The instruction queue 1022 itself can be implemented as a fixed or 
dynamically sized circular buffer. The instruction queue 1022 decouples the 
generation of instructions by the CPU 202 from their execution by the co- 
processor 224. 

Operand and result memory for each instruction is allocated by the 
memory manager 1031 (Fig. 5) in response to requests from the instruction 
generator 1030 during instruction generation. It is the allocation of this 
memory for newly generated instructions which triggers the interaction 
between the memory manager 1031 and the queue manager 1032 described 
below, and allows the system to adapt automatically to the amount of 
memory available and the complexity of the instructions involved. 

The instruction queue manager 1032 is capable of waiting for the co- 
processor 224 to complete the execution of any given instruction which has 
been generated by the instruction generator 1030. However, by providing a 
sufficiently large instruction queue 1022 and sufficient memory 203 for 
allocation by the memory manager 1031, it becomes possible to avoid having 
to wait for the co-processor 224 at all, or at least until the very end of the 
entire instruction sequence, which can be several minutes on a very large job. 
However, peak memory usage can easily exceed the memory available, and at 
this point the interaction between the queue manager 1032 and the memory 
manager 1031 comes into play. 

The instruction queue manager 1032 can be instructed at any time to 
"cleanup" the completed instructions by releasing the memory that was 
dynamically allocated for them. If the memory manager 1031 detects that 
available memory is either running low oris exhausted, its first recourse is to 
instruct the queue manager 1032 to perform such a cleanup in an attempt to 
release some memory which is no longer in use by the co-processor 224. This 
can allow the memory manager 1031 to satisfy a request from the instruction 
generator 1030 for memory required by a newly generated instruction, without 
the CPU 202 needing to wait for, or synchronize with, the co-processor 224. 



-303- 



30 



If such a request made by the memory manager 1031 for the queue 
manager 1032 to cleanup completed instructions does not release adequate 
memory to satisfy the instruction generator's new request, the memory 
manager 1031 can request that the queue manager 1032 wait for a fraction, 
say half, of the outstanding instructions on the pending instruction queue 
1040 to complete. This will cause the CPU 202 processing to block until 
some of the co-processor 224 instructions have been completed, at which point 
their operands can be freed, which can release sufficient memory to satisfy the 
request. Waiting for only a fraction of the outstanding instructions ensures 
that the co-processor 224 is kept busy by maintaining at least some 
instructions in its pending instruction queue 1040. In many cases the 
cleanup from the fraction of the pending instruction queue 1040 that the CPU 
202 waits for, releases sufficient memory for the memory manager 1031 to 
satisfy the request from the instruction generator 1030. 

In the unlikely event that waiting for the co-processor 224 to complete 
execution of, say, half of the pending instructions does not release sufficient 
memory to satisfy the request, then the final recourse of the memory manager 
1031 is to wait until all pending co-processor instructions have completed. 
This should release sufficient resources to satisfy the request of the 
instruction generator 1030, except in the case of extremely large and complex 
jobs which exceed the system's present memory capacity altogether. 

By the above described interaction between the memory manager 
1031 and the queue manager 1032, the system effectively tunes itself to 
maximize throughput for the given amount of memory 203 available to the 
system. More memory results in less need for synchronization and hence 
greater throughput. Less memory requires the CPU 202 to wait more often 
for the co-processor 224 to finish using the scarce memory 203, thereby 
yielding a system which still functions with minimal memory available, but 
at a lower performance. 
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The steps taken by the memory manager 1031 when attempting to 
satisfy a request from the instruction generator 1030 are summarized below. 
Each step is tried in sequence, after which the memory manager 1031 checks 
to see if sufficient memory 203 has been made available to satisfy the request. 
If so, it stops because the request can be satisfied; otherwize it proceeds to the 
next step in a more aggressive attempt to satisfy the request: 

1. Attempt to satisfy the request with the memory 203 already 
available. 

2. Cleanup all completed instructions. 

3. Wait for a fraction of the pending instructions. 

4. Wait for all the remaining pending instructions. 

Other options can also be used in the attempt to satisfy the request, 
such as waiting for different fractions (such as one-third or two-thirds) of the 
ponding instructions, or waiting for specific instructions which are known to be 
using large amounts of memory. 

Turning now to Fig. 7, in addition to the interaction between the 
memory manager 1031 and the queue manager 1032, the queue manager 
1032 can also initiate a synchronization with the co-processor 224 in the case 
where space in a fixed-length instruction queue buffer 1050 is exhausted. 
Such a situation is depicted in Fig. 7. In Fig. 7 the pending instructions 
queue 1040 is ten instructions in length. The latest instruction to be added 
to the queue 1040 has the highest occupied number. Thus where space is 
exhausted the latest instruction is located at position 9. The next 
instruction to be input to the co-processor 224 is waiting at position zero. 

In such a case of exhausted space, the queue manager 1032 will also 
wait for, say, half the pending instructions to be completed by the co-processor 
224. This delay normally allows sufficient space in the instruction queue 
1040 to be freed for new instructions to be inserted by the queue manager 
1032. 
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The method ueed by the queue manager 1032 when scheduling new 
instructions is as follows: 

1. Test to see if sufficient space is available in the instruction 
queue 1040. 

2 If sufficient space is not available, wait for the co-processor to 
complete some predetermined number or fraction of instructions. 
3. Add the new instructions to the queue. 

The method used by the queue manager 1032 when asked to wait for a 
given instruction is as follows: 

1. Wait until the co-processor 224 indicates that the instruction is 
complete. 

2. While there are instructions completed which are not yet cleaned 
up, clean up the next completed instruction in the queue. 

The method used by the instruction generator 1030 when issuing new 
instructions is as follows: 

1. Request sufficient memory for the instruction operands 1023 
from the memory manger 1031. 

2. Generate the instructions to be submitted. 

3. Submit the co-processor instructions to the queue manager 1032 
for execution. 

The following is an example of pseudo code of the above decision 
making processes. 



MEMORY MANAGER 

ALLOCATE_MEMORY 
BEGIN 

IF sufficient memory is NOT available to satisfy request 
THEN 

Clean up all completed instructions. 
ENDIF 
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IF sufficient memory is still NOT available to satisfy 

request 

THEN 

CALL WATT_FOR_1NSTRUCTION for half the 

pending 

instructions. 
ENDIF 

IF sufficient memory is still NOT available to satisfy 

request 

THEN 

RETURN with an error. 
ENDIF 

RETURN the allocated memory 

END 

QUEUE MANAGER 

SCHEDULE_INSTRUCTION 
BEGIN 

IF sufficient space is NOT available in the instruction queue 
THEN 

WAFT for the co-processor to complete some 

predetermined 

number of instructions. 
ENDIF 

Add the new instructions to the queue. 

END 

WAIT_FOR_INSTR UCTION0) 
BEGIN 

WATT until the co-processor indicates that instruction i is 

complete. 
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WHILE there are instructions completed which are not yet 

cleaned 

up 
DO 

IF the next completed instruction has a cleanup 

function 

THEN 

CALL the cleanup function 
ENDIF 

REMOVE the completed instruction from the queue 
DONE 

END 

INSTRUCTION GENERATOR 

GENERATE_INSTR UCTIONS 
BEGIN 

CALL ALLOCATE_MEMORY to allocate sufficient memory 

for 

the instructions operands from the memory manager. 
GENERATE the instructions to be submitted. 
CALL SCHEDULE^INSTRUCTION submit the co- 
processor 

instructions to the queue manager for execution. 

END 

3.3 Register Description of Co-processor 

As explained above in relation to Figs. 1 and 3, the co-processor 224 
maintains various registers 261 for the execution of each instruction stream. 

Referring to each of the modules of Pig, 2, Tabic 1 sets out the name, 
type and description of each of the registers utilized by the co-processor 224 
while Appendix B sets out the structure of each field of each register. 
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Table 1: Register Description 



NAME 


TYPE 


DESCRIPTION 


External Interface Controller Registers 


eic_cfg 


Config2 


Configuration 


eic_stat 


Status 


Status 


eic err int 


Interrupt 


Error and Interrupt Status 


eic_err_irit_en 


Config2 


Error and Interrupt Enable 


eic_test 


Config2 


Test modes 


eic__gen_pob 


Config2 


Generic bus programmable output bits 


eic Jiigh_addr 


Configl 


Dual address cycle offset 


eic_wtlb_v 


Control2 


Virtual address and operation bits for 
TLB Invalidate/Write 


eic_wtlb_p 


Config2 


Physical address and control bits for 
TLB Write 


eic_mrnu_v 


Status 


Most recent MMU virtual address 
translated, and current LRU location. 


eic_mrnu_v 


Status 


Most recent page table physical 
address fetched by MMU, 


eic_ip_addr 


Status 


Physical address for most recent IBus 
access to the PCI Bus, 


eic_rp_addr 


Status 


Physical address for most recent RBus 
access to the PCI Bus. 


eic_ig_addr 


Status 


Address for most recent IBus access to 
the Generic Bus. 


eic_rg_data 


Status 


Address for most recent RBus^access to 1 
the Ge neric Bus . _J 
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NAME | TYPE 


DESCRIPTION 


Local Memory Controller Registers 


ImLcfg 


Control2 


General configuration register 


lmi_sts 


Status 


General status register 


lmi_err_int 


Interrupt 


Error and interrupt status register 


lmi_err_int_en 


Control2 


Error and interrupt enable register 


lmi_dcfg 


Control2 


DRAM configuration register 


lmi_mode 


Control2 


SDRAM mode register 


Peripheral Interface Controller Registers 


picjefg 


Config2 


Configuration 


picjstat 


Status 


Status 


pic_err_jnt 


Interrupt 


Interrupt/Error Status 


pic_err_in t__e n 


Config2 


Interrupt/Error Enable 


pic_abus__cfjg 


Control2 


Configuration and control for ABus 


pic_abus_addr 


Configl 


Start address for ABus transfer 


pic_cent_cfg 


Control2 


Configuration and control for 
Centronics 


pic_cent_dir 


Config2 


Centronics pin direct control register 


pic_reverse_cfg 


Control2 


Configuration and control for reverse 
(input) data transfers 


picJimerO 


Configl 


Initial data timer value 


pic_timerl 


Configl 


Subsequent data timer value 


Miscellaneous Module Registers 


mm_cfg 


Config2 


Configuration Register 


nim_stat 


Status 


Status Register 


mm_errjnt 


Interrupt 


Error and Interrupt Register 


mm_err_int_en 


Config2 


Error and Interrupt Masks 


mm_gefg 


Config2 


Global Configuration Register 


mm_diag 


Config 


Diagnostic Configuration Register 


mm_grst 


Config 


Global Reset Register 


mra_gerr 


Config2 


Global Error Register 


mm_gexp 


Config2 


Global Exception Register 


mm_gint 


Config2 


Global Interrupt Register 


rnm_active 


Status 


Global Active signals 
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| NAME 


TYPE i 


DESCRIPTION 


Instruction Controller Registers 


If* fM CT 

1C - CI ? 


Onnfip-2 


V>Ull.LLg ULublUU J.VCglo Lft/J. 


X\j O L'Ci U 


Status/ 
Interrupt 




ir prr int 

IVj \rf 1. L X 111/ 


Interrupt 


Pjiror and Intprnrot Rp$Hstpr fwrite to 
clear error and interrupt) 


ic_err_int_en 


Config2 


Error and Interrupt Enable Register 


ic_ipa 


Control 1 


A stream Instruction Pointer 


ic_tda 


Configl ! 


A stream Todo Register 


ic fna 


Control 1 


A stream Finished Register 


ip in tfl 

IV/ J»IXt/CJL 


Confiel 


A stream Interrupt Register 


1U 11/ CL 


Stflt.llS 

IJlUUUO 


A st.TPflin Tjfist". Ovprlannpd TnstnirHon 
Sequence number \ 


ic_ipb 


Control 1 


B stream Instruction Pointer 


ic_tdb 


Configl 


B stream Todo Register 


ic_fnb 


Control 1 


B stream Finished Register 


ic_intb 


Configl 


B stream Interrupt Register 


ic_lob 


Status 


B stream Last Overlapped Instruction 
Sequence number 


ic_sema 


Status 


A stream Semaphore 


ic semb 


Status 


B stream Semaphore 


Data Cache ( 


Controller Registers 


dcc_cfgl 


Config2 


DCC configuration 1 register 


dcc_stat 


Status 


state machine status bits 


dcc_err _jnt 


Status 


DCC error status register 


dcc_err_int_en 


Controll 


DCC error interrupt enable bits 


dcc_cfg2 


Control2 


DCC configuration 2 register 


dcc_addr 


Configl 


Base address register for special 
address modes. 


dec lvO 


Controll 


"valid" bit status for lines 0 to 3 1 


dec lvl 


Controll 


"valid" bit status for lines 32 to 63 


dcc_lv2 


Controll 


"valid" bit status for lines 64 to 95 


dec lv3 


Controll 


Valid" bit status for lines 96 to 127 


dcc_raddrb 


Status 


Operand Organizer B request address 


dcc_raddrc 


Status 


Operand Organizer C request address 
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NAME 


TYPE 


DESCRIPTION 


dcc_test 


Control 1 


DCC test register 


Pixel Organizer Registers 


po cfe 


Config2 


Configuration Register 


po_stat 


Status 


Status Register 


po_err_int 


Interrupt 


Error/Interrupt Status Register 


po_err_int__en 


Config2 


Error/Interrupt Enable Register 


po_dmr 


Config2 


Data Manipulation Register 


po_subst 


Config2 


Substitution Value Register 


po_cdp 


Status 


Current Data Pointer 


po_len 


Control 1 


Length Register 


po_said 


Control 1 


Start Address or Immediate Data 


po_idr 


Controi2 


Image Dimensions Register 


po_muv_yalid 


Control2 


MUV valid bits 


po_muv 


Configl 


Base address of MUV RAM 


Operand Ori 


janizer B Registers 


oob_cfg 


Config2 


Configuration Register 


oob stat 


Status 


Status Register 


oob_err_int 


Interrupt 


Error/Interrupt Register 


oob_err_int_en 


Config2 


Error/Interrupt Enable Register 


oob dmr 


Config2 


Data Manipulation Register 


oob flubst 


Config2 


Substitution Value Register 


oob_cdp 


Status 


Current Data Pointer 


oob len 


Control 1 


Input Length Register 


oob_said 


Control 1 


Operand Start Address 


oob_tile 


Control 1 


Tiling length/offset Register 


Operand Organizer C Registers 


ooc_cfg 


Config2 


Configuration Register 


ooc_stat 


Status 


Status Register 


ooc_err_int 


Interrupt 


Error/Interrupt Register 


ooc_err_int_en 


Config2 


Error/Interrupt Enable Register 


ooc dmr 


Config2 


Data Manipulation Register 


ooc^subst 


Config2 


Substitution Value Register 


ooc_cdp 


Status 


Current Data Pointer 


ooc len 


Control 1 


Input Length Register 


ooc said 


Control 1 


Operand Start Address 
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NAME 


TYPE ! 


DESCRIPTION 


ooc_tile 


Control! 


Tiling length/offset Register 


JPEG Coder Register 


jc_cfg 


Config2 


configuration 


jc_stat 


Statue 


statue 


jc_err_int 


Interrupt 


error and interrupt status register 


jc_err_int_en 


Config2 


error and interrupt enable register 


jc_rsi 


Configl 


restart interval 


jc_decode 


Control2 ; 


decode of current instruction 


jcjres 


Control 1 


residual value 


jc_table_sel 


Control2 


table selection from decoded 
instruction 


Main Dat 


;a Path Register 


mdp_cfg 


Config2 


configuration 


mdp_stat 


Status 


status 


mdp_err_int 


Interrupt 




mdp_err_int_en 


Config2 


error/interrunt enahlp 


mdp_test 


Config2 


test modes 


mdp_opl 


Control2 


current operation 1 


mdp_op2 


Control2 


current operation 2 


mdp_por 


Control 1 


offset for plus operator 


mdp_bi 


Controll 


blend start/offset to index table entry 


mdp__bm 


Control 1 


blend end or number of rows and 
columns in matrix, binary places, and 
number of levels in halftoning 


mdp Jen 


Controll 


Length of blend to produce 


Result Organizer Register 


ro_cfg 


Config2 


Configuration Register 


ro stat 


Status 


Status Register 


ro_err_int 


Interrupt 


Error/Interrupt Register 


ro_err_int_en 


Config2 


Error/Interrupt Enable Register 


ro_dmr 


Config2 


Data Manipulation Register 


ro_subst 


Configl 


Substitution Value Register 


ro_cdp 


Status 


Current Data Pointer 


ro len 


Status 


Output Length Register 


ro sa 


Configl 


Start Address 


ro idr 


Configl 


Image Dimensions Register 
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NAME 


TYPE 


DESCRIPTION 


ro_ybase 


Configl 


co-processor Virtual Base Address 


ro_cut 


Configl 


Output Cut Register 


ro_lmt 


Configl 


Output Length Limit 


PCIBus Confi 


guration Space alias 






A read only copy of PCI configuration 
space registers 0x0 to OxD and OxF. 


dcl external cfe 


Status 


3 2 -bit field downloaded at reset from 
an external serial ROM. Has no 
influence on coprocessor operation. 


111 


put Interface Switch Registers 


iis_cfje[ 


Config2 


Configuration Register 


iis_stat 


Status 


Status Register 


iis_err_int 


Interrupt 


Interrupt/Error Status Register 


iifi_Grr_int_en 


Config2 


Interrupt/Error Enable Register 


iie_ic_addr 


Status 


Input address from IC 


iis_doc_addr 


Status 


Input address from DCC 


iis_po_addr 


Status 


Input address from PO 


iis_burst 


Status 


Burst Length from PO, DCC & IC 


iis_base_addr 


Configl 


Base address of co-processor memory 
object in host memory map. 


iis test 


Configl 


Test mode register 



The more notable ones of these registers include: 

(a) Instruction Pointer Registers (ic_ipa andic_ipb). This pair of 
registers each contains the virtual address of the currently executing 
instruction. Instructions are fetched from ascending virtual addresses and 
executed. Jump instruction can be used to transfer control across non- 
contiguous virtual addresses. Associated with each instruction is a 32 bit 
sequence number which increments by one per instruction. The sequence 
numbers are used by both the co-processor 224 and by the host CPU 202 to 
synchronize instruction generation and execution. 

(b) Finished Registers (ic_fna and icjiib). This pair of registers 
each contains a sequence number counting completed instructions. 

(c) Todo Register (ic_tda and ic_tdb). This pair of registers each 
contains a sequence number counting queued instructions. 
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(d) Interrupt Register (icjnta and icjntb). This pair of registers 
each contains a sequence number at which to interrupt. 

(e) Interrupt Status Registers (icj3tat.a_j>rimed and 
ic_stat.b_primed). This pair of registers each contains a primed bit which is 
a flag enabling the interrupt following a match of the Interrupt and Finished 
Registers. This bit appears alongside other interrupt enable bits and other 
status/configuration information in the Interrupt Status (icjstat) register. 

(f) Register Access Semaphores (ic_sema and ic_semb). The 
host CPU 202 must obtain this semaphore before attempting register 
accesses to the co-processor 224 that requires atomicity, ie. more than one 
register write. Any register accesses not requiring atomicity can be 
performed at any time. A side effect of the host CPU 202 obtaining this 
semaphore is that co-processor execution pauses once the currently executing 
instruction has completed. The Register Access Semaphore is implemented 
as one bit of the configuration7status register of the co-processor 224, These 
registers are stored in the Instruction Controllers own register area. As 
noted previously, each sub-module of the co-processor has its own set of 
configuration and status registers. These registers are set in the course of 
regular instruction execution. All of these registers appear in the register 
map and many are modified implicitly as part of instruction execution. 
These are all visible to the host via the register map. 

3.4 Format of Plural Streams 

As noted previously, the co-processor 224, in order to maximize the 
utilization of its resources and to provide for rapid output on any external 
peripheral device, executes one of two independent instruction streams. 
Typically, one instruction stream is associated with a current output page 
required by an output device in a timely manner, while the second instruction 
stream utilizes the modules of the co-prooessor 224 when the other instruction 
stream is dormant. Clearly, the overriding imperatives are to provide the 
required output data in a timely manner whilst simultaneously attempting to 
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maximize the use of resources for the preparation of subsequent pages, bands, 
etc. The co-processor 224 is therefore designed to execute two completely 
independent but identically implemented instruction streams (hereafter 
termed A and B). The instructions are preferably generated by software 
running on the host CPU 202 (Fig. 1) and forwarded to the raster image 
acceleration card 220 for execution by the co-processor 224. One of the 
instruction streams (stream A) operates at a higher priority than the other 
instruction stream (stream B) during normal operation. The stream or 
queue of instructions is written into a buffer or list of buffers within the host 
RAM 203 (Fig. 1) by the host CPU 202. The buffers are allocated at start-up 
time and locked into the physical memory of the host 203 for the duration of 
the application. Each instruction is preferably stored in the virtual memory 
environment of the host RAM 203 and the raster image co-processor 224 
utilizes a virtual to physical address translation scheme to determine a 
corresponding physical address with the in-host RAM 203 for the location of a 
next instruction. These instructions may alternatively be stored in the co- 
processors 224 local memory. 

Turning now to Fig. 8, there is illustrated the format of two instruction 
streams A and B 270, 271 which are stored within the host RAM 203. The 
format of each of the streams A and B is substantially identical. 

Briefly, the execution model for the co-processor 224 consists of: 

• Two virtual streams of instructions, the A stream and the B 

stream. 

• In general only one instruction is executed at a time. 

• Either stream can have priority, or priority can be by way of 
"round robin". 

• Either stream can be locked" in, ie. guaranteed to be executed 
regardless of stream priorities or availability of instructions on the other 
stream. 
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• Either stream can be empty. 

• Either stream can be disabled. 

• Either stream can contain instructions that can be 
"overlapped", ie. execution of the instruction can be overlapped with that of 
the following instruction if the following instruction is not also "overlapped". 

• Each instruction has a "unique" 32 bit incrementing sequence 
numbQr. 

• Each instruction can be coded to cause an interrupt, and/or a 
pause in instruction execution. 

• Instructions can be speculatively prefetched to minimize the 
impact of external interface latency. 

The instruction controller 235 is responsible for implementing the co- 
processor's instruction execution model maintaining overall executive control 
of the co-processor 224 and fetching instructions from the host RAM 203 when 
required. On a per instruction basis, the instruction controller 235 carries 
out the instruction decoding and configures the various registers within the 
modules via CBus 231 to force the corresponding modules to carry-out that 
instruction. 

Turning now to Fig. 9, there is illustrated a simplified form of the 
instruction execution cycle carried out by the instructions controller 235. The 
instruction execution cycle consists of four main stages 276-279. The first 
stage 276 is to determine if an instruction is pending on any instruction 
stream. If this is the case, an instruction is fetched 277, decoded and 
executed 278 by means of updating registers 279. 
3.5 Determine Current Active Stream 

In implementing the first stage 276, there are two steps which must 
be taken: 

1. Determine whether an instruction is pending; and 

2. Decide which stream of instructions should be fetched next. 
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In determining whether instructions are pending the following 
possible conditions must be examined: 

1. whether the instruction controller is enabled; 

2. whether the instruction controller is paused due to an internal 
error or interrup t; 

3. whether there is any external error condition pending; 

4. whether either of the A or B streams are locked; 

5. whether either stream sequence numbering is enabled; and 

6. whether either stream contains a pending instruction. 

The following pseudo code describes the algorithm for determining 
whether an instruction is pending in accordance with the above rules. This 
algorithm can be hardware implemented via a state transition machine 
within the instruction controller 235 in known manner: 

if not error and enabled and not bypassed and not self test mode 
if A stream locked and not paused 

if A stream enabled and (A stream 
sequencing disabled or instruction on A stream) 

instruction pending 
else 

no instruction pending 
end if 

else 

if B stream lockQd and not paused 

if B stream enabled and (B stream 
sequencing disabled or instruction on B stream) 

instruction pending 
else 

no instruction pending 
end if 
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olsc f* no stream is locked V 

if (A stream enabled and not paused 
and (A stream sequencing disabled or instruction on A stream)) 

or(B stream enabled and not paused 
and (B stream sequencing disabled or instruction on B stream)) 

instruction pending 
else 

no instruction pending 
end if 

end if 

else I* interface controller not enabled */ 

no instruction pending 

end if 



If no instruction is found pending, then the instruction controller 235 
will "spin" or idle until a pending instruction is found. 

To determine which stream is "active", and which stream is executed 
next, the following possible conditions are examined: 

1. whether either stream is locked; 

2. what priority is given to the A and B streams and what the 
last instruction stream was; 

3. whether either stream is enabled; and 

4. .whether either stream contains a pending instruction. 

The following pseudo code implemented by the instruction controller 
describes how to determine the next active instruction stream: 



if A stream locked 

next stream is A 

else ifB stream locked 
next stream is B 
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£fee f*no stream is locked V 

if (A stream enabled and (A stream sequencing disabled or 
instruction on A stream)) and not (B stream enabled and (B stream 
sequencing disabled or instruction on B stream)) 

next stream is A 
else if(B stream enabled and (B stream sequencing disabled 
or instruction on B stream)) and not (A stream enabled and (A stream 
sequencing disabled or instruction on A stream)) 

next stream is B 
eise /* both stream have instruction 7 

ifpri =0 /*A high, B low V 

next stream is A 
elseifpri = l /* A low, B high */ 

next stream is B 
else ifpri = 2 or 3 /* round robin */ 
if last stream is A 
next stream is B 

else 

next stream is A 
end if 
end if 

end if 

end if 

As the conditions can be constantly changing, all conditions must be 
determined together atomicaily. 

3.6 Fetch Instruction of Current Active Stream 

After the next active instruction stream is determined, the Instruction 
Controller 235 fetches the instruction using the address in the corresponding 
instruction pointer register (icjpa or iejpb). However, the Instruction 
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Controller 235 does not fetch an instruction if a valid instruction already 
exists in a prefetch buffer stored within the instruction controller 235. 
A valid instruction is in the prefetch buffer if: 

1. the prefetch buffer is valid; and 

2. the instruction in the prefetch buffer is from the same stream 
as the currently active stream. 

The validity of the contents of the prefetch buffer is indicated by a 
prefetch bit in the ic_stat register, which is set on a successful instruction 
prefetch. Any external write to any of the registers of the instruction 
controller 235 causes the contents of the prefetch buffer to be invalidated. 
3.7. Decode and Execute Instruction 

Once an instruction has been fetched and accepted the instruction 
controller 235 decodes it and configures the registers 229 of the eo-prooessor 
224 to execute the instruction. 

The instruction format utilized by the raster image co-processor 224 
differs from traditional processor instruction sets in that the instruction 
generation must be carried out instruction by instruction by the host CPU 202 
and as such is a direct overhead for the host. Further, the instructions 
should be as small as possible as they must be stored in host RAM 203 and 
transferred over the PCI bus 206 of Fig. 1 to the co-processor 224. Preferably, 
the co-processor 224 can be set up for operation with only one instruction. As 
much flexibility as possible should be maintained by the instruction set to 
maximize the scope of any future changes. Further, preferably any 
instruction executed by the co-processor 224 applies to a long stream of 
operand data to thereby achieve best performance. The co-processor 224 
employs an instruction decoding philosophy designed to facilitate simple and 
fast decoding for "typical instructions" yet still enable the host system to 
apply a finer control over the operation of the co-processor 224 for "atypical" 
operations. 
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Turning now to Fig. 10, there is illustrated the format of a single 
instruction 280 which comprizes eight words each of 32 bits. Each 
instruction includes an instruction word or opcode 281, and an operand or 
result type data word 282 setting out the format of the operands. The 
addresses 283-285 of three operands A, B and C are also provided, in addition 
to a result address 286. Further, an area 287 is provided for use by the host 
CPU 202 for storing information relevant to the instruction. 

The structure 290 of an instruction opcode 281 of an instruction is 
illustrated in Fig. 11. The instruction opcode is 32 bits long and includes a 
major opcode 291, a minor opcode 292, an interrupt (I) bit 293, a partial 
decode (Pd) bit 294, a register length (R) bit 295, a lock (L) bit 296 and a 
length 297. A description of the fields in the instruction word 290 is as 
provided by the following table. 
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Table 2: Opcode Description 



Field 


Description 1 


major opcode [3..0] 


Instruction category 
0: Reserved 

1: General Colour Space Conversion 

2: JPEG Compression and Decompression 

3: Matrix Multiplication 

4: Image Convolutions 

5: Image Transformations 

6: Data Coding 

7: Halftone 

8: Hierarchial image decompression 
9: Memory Copy 

10: Internal Register and Memory Access 
11: Instruction Flow Control 
12: Compositing 
13: Compositing 

14: Reserved j 
15: Reserved 


minor opcode 
I7..0] 


Instruction detail. The coding of this field is 
dependent on the major opcode. 


I 


1 = Interrupt and pause when competed, 
0= Don't interrupt and pause when 
completed 




Field 


Description 


Pd 


Partial Decode 

1 = use the "partial decode" mechanism. 

0 = Don't use the "partial decode" 

mechanism 


R 


1 = length of instruction is specified by the 
Pixel Organizer's input length register (pojen) 
0 = length of instruction is specified by the 
opcode length field. 


L 


1 = this instruction stream (A or B) is 
"locked" for the next instruction. 
0 = this instruction stream (A or B) is not 
Mocked" in for the next instruction. 


length [15,.0] 


number of data items to read or generate || 



By way of discussion of the various fields of an opcode, by setting the I- 
5 bit field 293, the instruction can be coded such that instruction execution sets 
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an interrupt and pause on completion of that instruction. This interrupt is 
called an "instruction completed interrupt". The partial decode bit 294 
provides for a partial decode mechanism such that when the bit is set and 
also enabled in the ic_cfg register, the various modules can be micro coded 
prior to the execution of the instruction in a manner which will be explained in 
more detail hereinafter. The lock bit 296 can be utilized for operations which 
require more than one instruction to set up. This can involve setting various 
registers prior to an instruction and provides the ability to "lock" in the 
current instruction stream for the next instruction. When the L-bit 296 is set, 
once an instruction is completed, the next instruction is fetched from the same 
stream. The length field 297 has a natural definition for each instruction 
and is defined in terms of the number of "input data items" or the number of 
"output data items" as required. The length field 297 is only 16 bits long. 
For instructions operating on a stream of input data items greater than 
64,000 items the R-bit 295 can be set, in which case the input length is taken 
from a pojen register within the pixel organizer 246 of Fig. 2. This register 
is set immediately before such an instruction. 

Returning to Fig. 10, the number of operands 283-286 required for a 
given instruction varies somewhat depending on the type of instruction 
utilized. The following table sets out the number of operands and length 
definition for each instruction type: 
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Table 3: Operand Types 



Instruction 
Class 


Length defined by 


#of 
operand 

3 


Compositing 


mnnt 


pixeis 


8 


General Color Space Conversion 




fl 1YQI O 


2 


JPEG decompression/compression 


input 


bytes 


2 


other decompression/compression 


input 


bytes 


2 


Image Transformations and 
Convolutions 


output 


bytes 


2 


Matrix Multiplication 


input 


pixels 


2 


Halftoning 


input 


pixels, bytes 


2 


Memory Copying 


input 


pixels, bytes 


1 


Hierarchial Image Decompression 


input 


pixels, bytes 


1 or 2 


Flow Control 


fixed 


fixed 


2 


Internal Access Instructions 


fixed 


fixed 


4 



Turning now to Fig. 12, there is illustrated, firstly, the data word 
format 300 of the data word or operand descriptor 282 of Fig. 10 for three 
operand instructions and, secondly, the data word format 301 for two operand 
instructions. The details of the encoding of the operand descriptors axe 
provided in the following table: 



Table 4: Operand Descriptors 



| Field 


Description 


what 


0 - instruction specific mode: 

This indicates that the remaining fields of the descriptor 
will be interpreted in line with the major opcode. 
Instruction specific modes supported are: 
majoT opcode = 0-11: Reserved 

major opcode = 1243: (Compositing): Implies that 
Operand C is a bitmap attenuation. The occ_dmr 
register will be set appropriately, with the cc=l and 
normalizo=0 

major opcode = 14-15: Reserved 

1 = sequential addressing 

2 = tile addressing 

3 = constant data 
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Field 


Description 


L 

. 


0 = not long: immediate data 

1 = long: pointer to data 


if 


internal format: 

0 = pixels 

1 = unpacked bytes 

2 = packed bytes 

3 = other 


S 


0 = set up Data Manipulation Register as appropriate for 
this operand 

1 = use the Data Manipulation Register as is 


c 


0 = not cacheable 

1 = cacheable 

Note: In general a performance gain will be achieved if an 
operand is specified as cacheable. Even operands 
displaying low levels of referencing localitv (such as 
sequential data) still benefit from being cached - as it 
allows data to be burst transferred to the host processor 
and is more efficient. 


p 


external format: 

0 = unpacked bytes 

1 = packed stream 


bo[2:0] 


bit offset. Specifies the offset within a byte of the start of 
bitwize data. 


R 


0 = Operand C does not describe a register to set. 

1 = Operand C describes a register to set. 

This bit is only relevant for instructions with less than 
three operands. j 



With reference to the above table, it should be noted that, firstly, in 
respect of the constant data addressing mode, the co-processor 224 is set up to 
fetch, or otherwize calculate, one internal data item, and use this item for the 
length of the instruction for that operand. In the tile addressing mode, the 
co-processor 224 is set up to cycle through a small set of data producing a 
"tiling effect". When the L-bit of an operand descriptor is zero then the data 
is immediate, ie. the data items appear literally in the operand word. 
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Returning again to Fig. 10, each of the operand and result words 283- 
286 contains either the value of the operand itself or a 32-bit virtual address 
to the start of the operand or result where data is to be found or stored. 

The instruction controller 235 of Fig. 2 proceeds to decode the 
instruction in two stages. It first checks to see whether the major opcode of 
the instruction is valid, raising an error if the major opcode 291 (Fig 11) is 
invalid. Next, the instruction is executed by the instruction controller 235 by 
means of setting the various registers via CBus 231 to reflect the operation 
specified by the instruction. Some instructions can require no registers to be 
set. 

The registers for each module can be classified into types based on 
their behavior. Firstly, there is the status register type which is "read only" 
by other modules and "read/write" by the module including the register. Next, 
a first type of configuration register, hereinafter called "configl", is 
"readywrite" externally by the modules and "read only" by the module 
including the register. These registers are normally used for holding larger 
type configuration information, such as address values. A second type of 
configuration register, herein known as ,, config2\ is readable and writable by 
any module but is read only by the module including the register. This type 
of register is utilized where bit by bit addressing of the register ia required. 

A number of control type registers are provided. A first type, 
hereinafter known as "controll" registers, is readable and writable by all 
modules (including the module which includes the register). The controll 
registers are utilized for holding large control information such as address 
values. Analogously, there is further provided a second type of control 
register, hereinafter known as M control2 ,r , which can be set on a bit by bit 
basis, 

A final type of register known as an interrupt register has bits within 
the register which are settable to 1 by the module including the register and 
resettable to zero externally by writing a "1" to the bit that has been set. 
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This type of register is utilized for dealing with the interrupts/errors flagged 
by each of the modules. 

Each of the modules of the co-processor 224 sets a c_active line on the 
CBufi 231 when it is busy executingan instruction. The instruction controller 
235 can then determine when instructions have been completed by "OR-ing" 
the c_active lines coming from each of the modules over the CBus 231. The 
local memory controller module 236 and the peripheral interface controller 
module 237 are able to execute overlapped instructions and include a 
cbackground line which is activated when they axe executing an overlapped 
instruction. The overlapped instructions are "local DMA" instructions 
transferring data between the local memory interface and the peripheral 
interface. 

The execution cycle for an overlapped local DMA instruction is slightly 
different from the execution cycle of other instructions. If an overlapped 
instruction is encountered for execution, the instruction controller 235 checks 
whether there is already an overlapped instruction executing. If there is, or 
overlapping is disabled, the instruction controller 235 waits for that 
instruction to finish before proceeding with execution of that instruction. If 
there is not, and overlapping is enabled, the instruction controller 235 
immediately decodes the overlapped instruction and configures the peripheral 
interface controller 237 and local memory controller 236 to carry out the 
instruction. After the register configuration is completed, the instruction 
controller 235 then goes onto update its registers (including finished register, 
status register, instruction pointer, etc.) without waitingfor the instruction to 
"complete" in the conventional sense. At this moment, if the finished 
sequence number equals the interrupt sequence number, the overlapped 
instruction completed 1 interrupt is primed rather than raising the interrupt 
immediately. The 'overlapped instruction completed' interrupt is raized 
when the overlapped instruction has fully completed. 
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Once the instruction has been decoded, the instruction controller 
attempts to prefetch the next instruction while the current instruction is 
executing. Most instructions take considerably longer to execute than they 
will to fetch and decode. The instruction controller 235 prefetches an 
instruction if all of the following conditions are met: 

1. the currently executing instruction is not set to interrupt and 

pause; 

2. the currently executing instruction is not a jump instruction; 

3. the next instruction stream is prefetch-enabled; and 

4. there is another instruction pending. 

If the instruction controller 235 determines that prefetching is 
possible it requests the next instruction, places it in a prefetch buffer and 
then validates the buffer. At this point there is nothing more fur the 
instruction controller 235 to do until the currently executing instruction has 
completed. The instruction controller 235 determines the completion of an 
instruction by examining the c_active and c_background lines associated with 
the CBus 231. 

3,8 Update Registers of Instruction Controller 

Upon completion of an instruction, the instruction controller 235 
updates its registers to reflect the new state. This must be done atomicaily 
to avoid problems with synchronising with possible external accesses. This 
atomic update process involves: 

1. Obtaining the appropriate Register Access Semaphore. If the 
semaphore is taken by an agent external to the Instruction Controller 235, the 
instruction execution cycle waits at this point for the semaphore to be 
released before proceeding. 

2. Updating the appropriate registers. The instruction pointer 
(ic_ipa or icjpb) is incremented by the size of an instruction, unless the 
instruction was a successful jump, in which case the target value of the jump 
is loaded into the instruction pointer, 
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The finished register (ic_fna or iefnb), is then incremented if sequence 
numbering is enabled. 

The status register (ic_stat) is also updated appropriately to reflect 
the new state. This includes sotting the pause bite if necessary. The 
Instruction Controller 235 pauses if an interrupt has occurred and pausing is 
enabled for that interrupt or if any error has occurred. Pausing is 
implemented by setting the instruction stream pause bits in the status 
register (a_pause or b^pause bits in ic_stat). To resume instruction 
execution, these bits should be reset to 0. 

3. Asserting a c_end signal on the CBus 231 for one clock cycle, 
which indicates to other modules in the co-processor 224 that an instruction 
has been completed. 

4. Raising an interrupt if required. An interrupt iB raized if: 

a. "Sequence number completed" interrupt occurs. That is, if 
the finished register (icjha or icjhb) sequence number is the same as 
interrupt sequence number. Then this interrupt is primed, sequence 
numbering is enabled, and the interrupt occurs; or 

b. the just oompleted instruction was coded to interrupt on 
completion, then this mechanism is enabled. 

3.9 Semantics of the Register Access Semaphore 

The Register Access Semaphore is a mechanism that provides atomic 
accesses to multiple instruction controller registers. The registers that can 
require atomic access are as follows: 

1. Instruction pointer register (ic_ipa and icjipb) 

2. Todo registers (ic_tda and ic_tdb) 

3. Finished registers (icjha and ic_fnb) 

4. Interrupt registers (icjnta and ic_intb) 

5. The pause bits in the configuration register (ic_cfg) 
External agents can read all registers safely at any time. External 

agents are able to write any registers at any time, however to ensure that the 
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Instruction Controller 235 does not update values in these registers, the 
external agent must first obtain the Register Access Semaphore. The 
Instruction Controller does not attempt to update any values in the 
abovementioned registers if the Register Access Semaphore is claimed 
externally. The instruction controller 235 updates all of the above mentioned 
registers in one clock cycle to ensure atomicity. 

As mentioned above, unless the mechanism is disabled, each 
instruction has associated with it a 32 bit "sequence number". Instruction 
sequence numbers increment wrapping through from OxFFFFFFFF to 
0x00000000. 

When an external write ie made into one of the Interrupt Registers 
(icjnta or ic.intb), the instruction controller 235 immediately makes the 
following comparisons and updates: 

1 . If the interrupt sequence number (ie. the value in the Interrupt 
Register) is "greater" (in a modulo sense) than the finished sequence number 
(ie. the value in the Finished Register) of the same stream, the instruction 
controller primes the "sequence number completed" interrupt mechanism by 
setting the "sequence number completed" primed bit (a j>rimed or b_primed 
bit in ic_stat) in the status register. 

2. If the interrupt sequence number is not "greater" than the 
finished sequence number, but there is an overlapped instruction in progress 
in that stream and the interrupt sequence number equals the last overlapped 
instruction sequence number (ie. the value in the ic_loa or icjob register), 
then the instruction controller primes the "overlapped instruction sequence 
number completed" interrupt mechanism by setting the a_ol_primed or 
b_ol_primed bits in the ic_stat register. 

3. If the interrupt sequence number is not "greater" than the 
finished sequence number, and there is an overlapped instruction in progress 
in that stream, but the interrupt sequence number does not equal the last 
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overlapped instruction sequence number, then the interrupt sequence number 
represents a finished instruction, and no interrupt mechanism is primed. 

4. If the interrupt sequence number is not "greater" than the 
finished sequence number, and there is no overlapped instruction in progress 
in that stream, then the interrupt sequence number must represent a finished 
instruction, and no interrupt mechanism is primed. 

External agents can set any of the interrupt primed bits (bits 
a_primed, a_oU>rimed, berimed or b_ol_j)rimed) in the status register to 
activate or de-activate this interrupt mechanism independently. 
3.10 Instruction Controller 

Turning now to Fig. 13, there is illustrated the instruction controller 
235 in more detail. The instruction controller 235 includes an execution 
controller 305 which implements the instruction execution cycle as well as 
maintaining overall executive control of the co-processor 224. The functions 
of the execution controller 305 include maintaining overall executive cont rol of 
the instruction controller 235, determining instructing sequencing, instigating 
instruction fetching and prefetching, initiating instructing decoding and 
updating the instruction controller registers. The instruction controller 
further includes an instruction decoder 306. The instruction decoder 306 
accepts instructions from a prefetch buffer controller 307 and decodes them 
according the aforementioned description. The instruction decoder 306 is 
responsible for configuring registers in the other co-processor modules to 
execute the instruction. The prefetch buffer controller 307 manages the 
reading and writing to a prefetch buffer within the prefetch buffer controller 
and manages the interfacing between the instruction decoder 306 and the 
input interface switch 252 (Pig. 2). The prefetch buffer controller 307 is also 
responsible for managing the updating of the two instruction pointer registers 
(ic.ipa and ic_ipb). Access to the CBus 231 (Fig. 2) by the instruction 
controller 235, the miscellaneous module 239 (Fig. 2) and the external 
interface controller 238 (Fig. 2) is controlled by a "CBus" arbitrator 308 which 
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arbitrates between the three modules' request for access. The requests are 
transferred by^ means of a control bus (CBus) 231 to the register units of the 
various modules. 

Turning now to Fig. 14, there is illustrated the execution controller 
305 of Fig. 13 in more detail. As noted previously, the execution controller is 
responsible for implementing the instruction execution cycle 275 of Fig. 9 and, 
in particular, is responsible for: 

1- Determining which instruction stream the next instruction is 
to come from; 

2 . Initiating fetching of that instruction; 

3. Signalling the instruction decoder to decode the instruction as 
residing in the prefetch buffer; 

4. Determining and initiating any prefetching of the next 
instruction; 

5. Determining instruction completion; and 

6. Updating the registers after the instruction has completed. 
The execution controller includes a large core state machine 310 

hereinafter known as ,1 the central brain" which implements the overall 
instruction execution cycle. Turning to Fig. 15, there is illustrated the state 
machine diagram for the central brain 310 implementing the instruction 
execution cycle as aforementioned. Returning to Fig. 14, the execution 
controller includes an instruction prefetch logic unit 311. This unit is 
responsible for determining whether there is an outstanding instruction to be - 
executed and which instruction stream the instruction belongs to. The start 
312 and prefetch 313 states of the transition diagram of Fig. 15 utilize this 
information in obtaining instructions. A register management unit 317 of 
Fig. 14 is responsible for monitoring the register access semaphores on both 
instruction streams and updating all necessary registers in each module. 
The register management unit 317 is also responsible for comparing the 
finished register (icjha or ic_fnb) with the interrupt register (ic_inta or 
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ic_intb) to determine if a "sequence number completed" interrupt is due. The 
register management unit 317 is also responsible for interrupt priming. An 
overlapped instructions unit 318 is responsible for managing the finishing off 
of an overlapped instruction through management of the appropriate status 
bits in the ic_stat register. The execution controller also includes a decoder 
interface unit 319 for interfacing between the central brain 310 and the 
instruction decoder 306 of Fig. 13, 

Turning now to Fig. 16, there is illustrated the instruction decoder 306 
in more detail. The instruction decoder is responsible for configuring the co- 
processor to execute the instructions residing in the prefetch buffer. The 
instruction decoder 306 includes an instruction decoder sequencer 321 which 
comprizes one large state machines broken down into many smaller state 
machines. The instruction sequencer 321 communicates with a CBus 
dispatcher 312 which is responsible for setting the registers within each 
module. The instruction decoder sequencer 321 also communicates relevant 
information to the execution controller such as instruction validity and 
Libtruction overlap conditions. The instruction validity check being to check 
that the instruction opcode is not one of the reserved opcodes. 

Turning now to Fig. 17, there is illustrated, in more detail, the 
instruction dispatch sequencer 321 of Fig. 16. The instruction dispatch 
sequencer 321 includes a overall sequencing control state machine 324 and a 
series of per module configuration sequencer state machines, eg. 325, 326. 
One per module configuration sequencer state machine is provided for each 
module to be configured. Collectively the state machines implement the co- 
processor's microprogramming of the modules. The state machines, eg. 325, 
instruct the CBus dispatcher to utilize the global CBus to set various 
registers so as to configure the various modules for processing. A side effect 
of writing to particular registers is that the instruction execution commences. 
Instruction execution typically takes much longer than the time it takes for 
the sequencer 321 to configure the co-processor registers for execution. In 
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appendix A, attached to the present specification, there is disclosed the 
microprogramming operations performed by the instruction sequencer of the 
co-processor in addition to the form of set up by the instruction sequencer 321. 

ha practice, the Instruction Decode Sequencer 321 does not configure 
all of the modules within the co-processor for every instruction. The table 
below shows the ordering of module configuration for each class of instruction 
with the module configured including the pixel organizer 246 (PO), the data 
cache controller 240 (DCC), the operand organizer B 247 (OOB), the operand 
organizer C 248 (OOC), main data path 242 (MDP), results organizer 249 
(RO), and JPEG encoder 24 1 (JC). Some of the modules are never configured 
during the course of instruction decoding. These modules are the External 
Interface Controller 238 (EIC), the Local Memory Controller 236 (LMC), the 
Instruction Controller 235 itself (IC), the Input Interface Switch 252 (IIS) and 
the Miscellaneous Module (MM). 
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Table 5: Module Setup Order 



. . 

Instruction 
Class 


1 

Module Configuration 


Sequence 

ID 


Compositing 


PO, DCC, OOB, OOC, MDP. RO 


1 


CSC 


PO, DCC, OOB, OOC, MDP. RO 


2 


JPEG coding 


PO, DCC, OOB, OOC, JC, RO 


3 


Data coding 


PO, DCC, OOB, OOC, JC. RO 


3 


Transformations and 
Convolutions 


PO, DCC, OOB, OOC, MDP, RO 


2 


Matrix Multiplication 


PO, DCC, OOB, OOC, MDP. RO 


2 


Halftoning 


PO, DCC, OOB, MDP, RO 


4 


General memory copy 


PO, JC, RO 


8 


Peripheral DMA 


PIC 


D 


Hierarchial Image ■ 
Horizontal 

In fa^ninl *a f \i \t\ 

M.I.X, iJVrf JL yj \JLCL \j 1KJ 11 


PO DCC OOB OOP Mnp pn 


b 


Hierarchial Image - 
others 


PO. DCC, OOB, OOC, MDP, RO 


4 


Internal access 


RO, RO, RO. RO 




others 







Turning now to Fig. 17, each of the module configuration sequencers, 
eg. 325 is responsible for carrying out the required register access operations 
to configure the particular module. The overall sequencing control state 
machine 324 is responsible for overall operation of the module configuration 
sequencer in the aforementioned order. 

Referring now to Fig. 18, there is illustrated 330 the state transition 
diagram for the overall sequencing control unit which basically activates the 
relevant module configuration sequencer in accordance with the above table. 
Each of the modules configuration sequencers is responsible for controlling the 
CBus dispatcher to alter register details in order to set the various registers 
in operation of the modules. 



-336- 



63 



Turning now to Fig. 19, there is illustrated the prefetch buffer 
controller 307 of Fig. 13 in more detail. The prefetch buffer controller 
consists of a prefetch buffer 336 for the storage of a single co-processor 
instruction (six times 32 bit words). The prefetch buffer includes one write 
port controlled by a Bus sequencer 336 and one read port which provides data 
to the instruction decoder, execution controller and the instruction controller 
CBus interface. The Bus sequencer 336 is responsible for observing bus 
protocols in the connection of the prefetch buffer 335 to the input interface 
switch. An address manager unit 337 is also provided which deals with 
address generation for instruction fetching The address manager unit 337 
performs the functions of selecting one of ic.ipa or icjpb to place on the bus to 
the input interface switch, incrementing one of ic_ipa or ic_ipb based on which 
stream the last instructions was fetched from and channelling jump target 
addresses back to the ic_ipa and ic_ipb register. A PBC controller 339 
maintains overall control of the prefetched buffer controller 307. 
3.11 Description of a Modules Local Register File 

As illustrated in Fig. 13, each module, including the instruction 
controller module itself, has an internal set of registers 304 as previously 
defined in addition to a CBus interface controller 303 as illustrated in Fig. 20 
and which is responsible for receiving CBus requests and updating internal 
registers in light of those requests. The module is controlled by writing 
registers 304 within the module via a CBus interface 302. A CBus 
arbitrator 308 {Fig. 13) is responsible for determining which module of the 
instruction controller 235, the external interface controller or the 
miscellaneous module is able to control the CBus 309 for acting as a master 
of the CBus and for the writing or reading of registers. 

Fig. 20, illustrates, in more detail, the standard structure of a CBus 
interface 303 as utilized by each of the modules. The standard CBus 
interface 303 accepts read and write requests from the CBus 302 and includes 
a register file 304 which is utilized 341 and updated on 341 by the various 
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siibmodules within a module. Further, control lines 344 are provided for the 
updating of any submodule memory areas including reading of the memory 
areas. The standard CBus interface 303 acts as a destination on the CBus, 
accepting read and write requests for the register 304 and memory objects 
inside other submodules, 

A "c_reset" signal 345 gets every register inside the Standard CBus 
interface^ 103 to their default states. However, "preset" will not reset the 
state machine that controls the handshaking of signals between itself and the 
CBus Master, so even if "c_reset" is asserted in the middle of a CBus 
transaction, the transaction will still finish, with undefined effects. The 
"c_int n 347, "c_exp" 348 and "c_err" 349 signals are generated bom the 
content of a modules err_int and err_int_en registers by the following 
equations: 

c^err = ^ errorfi] ANDerr_mask[i] (i) 
erro(i]not reserved 



c ~ int = X interrupt^] AND int_ maskfi] (2) 

mterrupflijnot reserved 

c_ exp = £ exceptional AND exp_ mask[i] (3) 

[ijnot reserved 

The signals "c.sdatajn" 345 and "csvalidjn" are data and valid signals from the 
previous module in a daisy chain of modules. The signals "cj*data_out" and n c_svalid_out ,f 
350 are data and valid signals going to the next module in the daisy chain. 

The functionality of the Standard CBus interface 303 includes: 

1. register read/write handling 

2. memory area read/write handling 

3. test mode read/write handling 

4. aubmodule observe/update handling 
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3.12 Register Read/Write Handling 

The Standard CBus Interface 303 accepts register read/write and bit 
set requests that appears on the CBus. There are two types of CBus 
instructions that Standard CBus Interface handles: 

1. Type A 

Type A operations allow other modules to read or write 1, 2, 3, or 4 
bytes into any register inside Standard CBus Interface 303. For write 
operations, the data cycle occurs in the clock cycle immediately after the 
instruction cycle. Note that the type field for register write and read are 
"10D0" and "1001" respectively. The Standard CBus Interface 303 decodes 
the instruction to check whether the instruction is addressed to the module, 
and whether it is a read or write operation. For read operation, the Standard 
CBus Interface 303 usee the "reg" field of the CBus transaction to select which 
register output is to put into the "c_sdata M bus 350. For write operations, the 
Standard CBus Interface 303 uses the "reg" and "byte" fields to write the data 
into the selected register. After read operation is completed, the Standard 
CBus Interface returns the data and asserts "c^svalid" 350 at the same time. 
After write operations are completed, the Standard CBus Interface 303 
asserts M c_svalid M 350 to acknowledge. 

2. Type C 

Type C operations allow other modules to write one or more bits in one 
of the bytes in one of the registers. Instruction and data are packed into one 
word. 

The Standard CBus Interface 303 decodes the instruction to check 
whether the instruction is addressed to the module. It also decodes M reg", 
"byte" and "enable" fields to generate the required enable signals. It also 
latches the data field of the instruction, and distributes it to all four bytes of a 
word so the required bit(s) are written in every enabled bit(s) in every enabled 
byte(s). No acknowledgment is required for this operation. 
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3.13 Memory Area Read /Write Handling 

The Standard CBus Interface 303 accepts memory read and memory 
write requests that appears on the CBus. While accepting a memoiy 
read/write request, the Standard CBus Interface 303 checks whether the 
request is addressed to the module. Then, by decoding the address field in 
the instruction, the Standard CBus Interface generates the appropriate 
address and address strobe signals 344 to the submodule which a memoiy 
read/write operation is addressed to. For write operations the Standard 
CBus Interface also passes on the byte enable signals from the instruction to 
the sub modules. 

The operation of the standard CBus interface 303 is controlled by a 
read/write controller 352 which decodes the type field of a CBus instruction 
from the CBus 302 and generates the appropriate enable signals to the 
register file 304 and output selector 353 so that the data is latched on the 
next cycle into the register file 304 or forwarded to other submodules 344. If 
the CBus instruction is a register read operation, the read/write controller 352 
enables the output selector 353 to select the correct register output going onto 
the "c_sdata bus" 345, If the instruction is a register write operation, the 
read/write controller 352 enables the register file 304 to select the data in the 
next cycle. If the instruction is a memory area read or write, then the 
read/write controller 352 generates the appropriate signals 344 to control 
those memory areas under a modules control. The register file 304 contains 
four parts, being a register select decoder 355, an output selector 353, 
interrupt 356, error 357 and exception 358 generators, unmasked error 
generator 359 and the register components 360 which make up the registers of 
that particular module. The register select decoder 355 decodes the signal 
"ref_en" (register file enable), "write" and "reg" from the read/write controller 
352 and generates the register enable signals for enabling the particular 
register of interest. The output selector 353 selects the correct register data 
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to be output on c_sdata_out lines 350 for register read operations according to 
the signal "reg M output from the read/write controller 352. 

The exception generators 356-359 generate an output error signal, eg. 
347-349, 362 when an error is detected on their inputs. The formula for 
calculating each output error is as aforementioned. 

The register components 360 can be defined to be of a number of types 
in accordance with requirements as previously discussed when describing the 
structure of the register set with reference to Table 5. 
3,14 CBus Structure 

As noted previously, the CBus (control bus) is responsible for the 
overall control of each module by way transferring information for the setting 
of registers within each module's standard CBus interface. It will be evident 
from the description of the standard CBus interface that the CBus serves two 
main purposes: 

1. It is the control bus that drives each of the modules. 

2. It is the access bus for RAMs, FIFOs and status information 
contained within each of the modules. 

The CBus uses an instruction-address-data protocol to control 
modules by the setting configuration registers within the modules. In 
general, registers will be set on a per instruction basis but can be modified at 
anytime. The CBus gathers status and other information, and accesses RAM 
and FIFO data from the various modules by requesting data. 

The CBus is driven on a transaction by transaction basis either by: 

1. the Instruction Controller 235 (Fig. 2) when executing 
instructions, 

2. the External Interface Controller 238 (Fig. 2) when performing 
a target (slave) mode bus operation, or 

3. an external device if the External CBus Interface is so 
configured. 
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In each of these cases, the driving module is considered to be the 
source module of the CBus, and all other modules possible destinations. 
Arbitration on this bus is carried out by the Instruction Controller. 

The following table sets out one form of CBus signal definitions 
suitable for use with the preferred embodiment: 



Table 6: CBua Signal Definition 



Name 

c_iad[31:0] 


source 


Definition 

instruction-address- 
data 


c_yalid 


source 


CBus instruction valid 


c_sdataf31:0] 


destination 


status/read data 


c_svalid 




status/read data valid 


c_reset[15:0] 


source 


reset lines to each 
module 


c_active[15:0] 


destination 


active lines from each 
module 


c_background[15:0] 


destination 


background active 
lines from each 
module 


c_int[16:0] 


destination 


interrupt lines from 
each module 


c_error[15:0] 


destination 


error lines from each 
module 


cjreql, c_req2 


EIC, external 


bus control request 



j Name 


Type 


Definition 1 


c_gntl, c_gnt2 


IC 


bus control grant 


c_end 


IC 


end of instruction 


elk 


global 


clock 
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A CBus cjad signal contains the addressing data and is driven by the 
controller in two distinct cycles: 

1. Instruction cycles (c_valid high) where the CBus instruction 
and an address is driven onto c_iad; and 

2. Data cycles (c.valid low) where data is driven onto c_iad (write 
operations) or c_sdata (read operations). 

ha the case of a write operation, the data associated with an 
instruction is placed on the c_iad bus in the cycle directly following the 
instruction cycle, ha the case of a read operation, the target module of the 
) read operation drives the c_sdata signal until the data cycle completes. 

Turning now to Fig. 21, the bus includes a 32 bit instruction-address- 
data field which can be one of three types 370-372: 

1. Type A operations (370) are used to read and write registers 
and the per-module data areas within the co-processor. These operations can 
be generated by the external interface controller 238 performing target mode 
PCI cycles, by the instruction controller 231 configuring the co-processor for an 
instruction, and by the External CBus Interface. 

For these operations, the data cycle occurs in the clock cycle 
immediately following the instruction cycle. The data cycle is acknowledged 
by the designation module using the c_svalid signaL 

2. Type B operations (371) are used for diagnostic purposes to 
access any local memory and to generate cycles on the. Generic Interface. 
These operations will be generated by the External Interface Controller 
performing target mode PCI cycles and by the External CBus Interface. The 
data cycle can follow at any time after the instruction cycle. The data cycle is 
acknowledged by the destination module using the c.svalid signal. 

3- Type C operations (372) are used to set individual bits within 
a module's registers. These operations will be generated by the instruction 
controller 231 configuring the co-processor's for an instruction and by the 
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External CBus Interface. There is no data cycle associated with a Type C 
operation, data is encoded in the instruction cycle. 

The type field of each instruction encodes the relevant CBus 
transaction type in accordance with the following table: 



Table 7: CBus Transaction Types 



c_iad.type 
value 


transaction type 


instruction 
x\jl iiacl u type 


0000 


no-op 


A, a, C 


0001 


reserved 




0010 


peripheral interface write 


B 


UU11 


peripheral interface read 


B 


0100 


generic bus write 


B 


0101 


generic bus read 


B 


0110 


local memory write 


B 


0111 


local memory read 


B 


1000 


register write 


A 


1001 


register read 


A 


1010 


module memory write 


A 


1011 


module memory read 


A 


1100 


test mode write 


A 


1101 


lest mode read 


A 


liio T 


bit set 


C 


1111 


reserved 





The byte field is utilized for enabling bits within a register to be set. 
The module field sets out the particular module to which an instruction on the 
CBus is addressed. The register field sets out which of the registers within a 
module is to be updated. The address field is utilized for addressing memory 
portions where an operation is desired on those memory portions and can be 
utilized for addressing RAMs, FIFOs, etc. The enable field enables selected 
bits within a selected byte when a bit set instruction is utilized. The data 
field contains the bit wize data of the bits to be written to the byte selected for 
update. 
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As noted previously, the CBus includes a c.aetive line for each module, 
which is asserted when ever a module has outstanding activity pending. The 
instruction controller utilizes these signals to determine when an instruction 
has completed. Further, the CBus contains a c.background line for each 
module that can operate in a background mode in addition to any preset, error 
and interrupt lines, one for each module, for resetting, detecting errors and 
interrupts. 

3.15 Co-processor Data Types and Data Manipulation 

Returning now to Fig. 2, in order to substantially simplify the 
operation of the co-processor unit 224, and in particular the operation of the 
major computational units within the oo-processor being the JPEG coder 241 
and the main data path 242, the co-processor utilizes a data model that 
differentiates between external formats and internal formats. The external 
data formats are the formats of data as it appears on the co-processor's 
external interfaces such as the local memory interface or the PCI bus. 
Conversely, the internal data formats are the formats which appear between 
the main functional modules of the co-processor 224. This is illustrated 
schematically in Pig. 22 which shows the various input and output formats. 
The input external format 381 ig the format which is input to the pixel 
organizer 246, the operand organizer B 247 and the operand organizer C 248. 
These organizers are responsible for reformatting the input external format 
data into any of a number of input internal formats 382, which may be 
inputted to the JPEG coder unit 241 and the main data path unit 242. These 
two functional units output data in any of a number of output internal 
formats 383, which are converted by the results organizer 249 to any of a 
number of required output formats 304. 

In the embodiment shown, the external data formats can be divided 
into three types. The fust type is a "packed stream" of data which consists of 
a contiguous stream of data having up to four channels per data quantum, 
with each channel consisting of one, two, four, eight or sixteen bit samples. 
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This packed stream can typically represent pixels, data to be turned into 
Pixels, or a stream of packed bits. The co-processor is designed to utilize 
little endian byte addressing and bigendian bit addressing within a byte. In 
Fig. 23, there is illustrated a first example 386 of the packed stream format. 
It is assumed that each object 387 is made up of three channels being channel 
0, channel 1 and channel 2, with two bits per channel. The layout of data for 
this format is as indicated 388. In a next example 390 of Fig. 24, a four 
channel object 395 having eight bits per channel is illustrated 396 with each 
data object taking up a 32 bit word. In a third example 395 of Pig. 25, one 
channel objects 396 are illustrated which each take up eight bits per channel 
starting at a bit address 397. Naturally, the actual width and number of 
channels of data will vary depending upon the particular application involved. 

A second type of external data format is the "unpacked byte stream" 
which consists of a sequence of 32 bit words, exactly one byte within each word 
being valid. An example of this format is shown in Fig. 26 and designated 
399, in which a single byte 400 is utilized within each word. 

A further external data format is represented by the objects classified 
as an "other" format. Typically, these data objects are large table-type data 
representing information such as colour space conversion tables, Huffman 
coding tables and the like. 

The co-processor utilizes four different internal data types. A first 
type is known as a "packed bytes" format which comprizes 32 bit words, each 
consisting of four active bytes, except perhaps for a final 32 bit word. In Fig. 
27, there is illustrated one particular example 402 of the packed byte format 
with 4 bytes per word. 

The next data type, illustrated with reference to Fig. 28, is "pixel" 
format and comprises 32 bit words 403, consisting of four active byte channels. 
This pixel format is interpreted as four channel data. 

A next interna] data type illustrated with reference to Fig. 29 is an 
"unpacked byte" format, in which each word consists of one active byte channel 
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405 and three inactive byte channels, the active byte channel being the least 
significant byte. 

All other internal data objects are classified by the "other" data 

format. 

Input data in a given external format is converted to the appropriate 
internal format. Fig. 30 illustrates the possible conversions carried out by 
the various organizers from an external format 410 to an internal format 411. 
Similarly, Fig. 31 illustrates the conversions carried out by the results 
organizer 249 in the conversion from internal formats 412 to external formats 
413. 

The circuitry to enable the following conversions to take place are 
described in greater detail below. 

Turning firstly to the conversion of input data external formats to 
internal formats, in Fig. 32 there is shown the methodology utilized by the 
various organizers in the conversion process. Starting initially with the 
external other format 416, this is merely passed through the various 
organizers unchanged. Next, the external unpacked byte format 417 
undergoes unpacked normalization 418 to produce a format 419 known as 
internally unpacked bytes. The process of unpacked normalization 418 
involves discarding the three inactive bytes from an externally unpacked byte 
stream. The process of unpacked normalization is illustrated in Fig. 33 
wherein the input data 417 having four byte channels wherein only one byte 
channel is valid results in the output format 419 which merely comprizes the 
bytes themselves. 

Turning again to Fig. 32, the process of packed normalization 421 
involves translating each component object in an externally packed stream 
422 into a byte stream 423. If each component of a channel is less than a 
byte in size then the samples are interpolated up to eight bit values. For 
example, when translating four bit quantities to byte quantities, the four bit 
quantity OxN is translated to the byte value OxNN. Objects larger than one 
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byte are truncated. The input object sizes supported on the stream 422 are 1, 
2, 4, 8 and 16 bit sizes, although again these may be different depending upon 
the total width of the data objects and words in any particular system to 
which the invention is applied. 

Turning now to Pig. 34, there is illustrated one form of packed 
normalization 421 on input data 422 which is in the form of 3 channel objects 
with two bits per channel (as per the data format 386 of Fig. 23). The output 
data comprizes a byte channel format 423 with each channel "interpolated 
up" where necessary to comprize an eight bit sample. 

Returning to Fig. 32, the pixel streams are then subjected to either a 
pack operation 425, an unpacked operation 426 or a component selection 
operation 427. 

Li Fig. 35 there is shown an example of the packed operation 425 
which simply involves discarding the inactive byte channel and producing a 
byte stream, packed up with four active bytes per word. Hence, a single valid 
byte stream 430 is compressed into a format 431 havingfour active bytes per 
word. The unpacking operation 426 involves almost the reverse of the 
packing operation with the unpacked bytes being placed in the least 
significant byte of a word. This is illustrated in Fig. 36 wherein a packed 
byte stream 433 is unpacked to produce result 434. 

The process of component selection 427 is illustrated in Fig. 37 and 
involves selecting N components from an input stream, where N is the number 
of input channels per quantum. The unpacking process can be utilized to 
produce "prototype pixels" eg. 437, with the pixel channels filled from the 
least significant byte. Turning to Fig. 38, there is illustrated an example of 
component selection 440 wherein input data in the form 436 is transformed 
by the component selection unit 427 to produce prototype pixel format 437. 

After component selection, a process of component substitution 440 
(Fig. 32) can be utilized. The component substitution process 440 is 
illustrated in Fig. 38 and comprizes replacing selected components with a 
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constant data value stored within aninternal data register 441 to produce, as 
an example, output components 242. 

Returning again to Pig. 32, the output of stages 425, 426 and 440 is 
subjected to a lane swapping process 444. The lane swapping process, as 
illustrated in Fig. 39, involves a byte-wize multiplexing of any lane to any 
other lane, including the replication of a first lane onto a second lane. The 
particular example illustrated in Fig. 39 includes the replacement of channel 
3 with channel 1 and the replication of channel 3 to channels 2 and channel 1. 

Returning again to Fig. 32, after the lane swapping step 444 the data 
stream can be optionally stored in the multi-used value RAM 250 before 
being read back and subjected to a replication process 446. 

The replication process 446 simply replicates the data object 
whatever it may be. In Fig. 40, there is illustrated a process of replication 
446 as applied to pixel data. In this case, the replication factor is one. 

In Fig. 41, there is illustrated a similar example of the process of 
replication applied to packed byte data. 

In Fig. 42, there is illustrated the process utilized by the result 
organizer 249 for transferral of data in an output internal format 383 to an 
output external format 384. This process includes equivalent steps 424, 425, 
426 and 440 to the conversion pnjcess described in Fig. 32. Additionally, the 
process 450 includes the steps of component deselection 451. denormalization 
452, byte addressing 453 and write masking 454. The component 
deselection process 451, as illustrated in Fig. 43, is basically the inverse 
operation of the component selection process 427 of Fig. 37 and involves the 
discarding of unwanted data. For example, in Fig. 43, only 3 valid channels 
of the input arc taken and packed into data items 456. 

The denormalization process 452 is illustrated with reference to Fig. 
44 and is loosely the inverse operation of the packed normalization process 
421 of Fig. 34. The denormalization process involves the translation of each 
object or data item, previously treated as a byte, to a non-byte value. 
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The byte addressing process 453 of Fig. 42 deals with any byte wize 
reorganization that is necessary to deal with byte addressing issues . For an 
externally unpacked byte output stream, the least two significant bits of the 
stream's address correspond to the active stream. The byte addressing step 
453 is responsible for re-mapping the output stream from one byte channel to 
another when external unpacked bytes are utilized (Fig. 45). Where an 
externally packed stream is utilized (Fig. 46), the byte addressing module 453 
remaps the start address of the output stream as illustrated. 

The write masks process 454 of Fig. 42 is illustrated in Fig. 47 and is 
used to mask off a particular channel eg. 460 of a packed stream which is not 
to be written out. 

The details of the input and output data type conversion to be applied 
are specified by the contents of the corresponding Data Manipulation 
Registers: 

• The Pixel Organizer Data Manipulation Register (po_drar) 

• The Operand Organizer B and Operand Organizer C Data 
Manipulation Registers (oob_dmr, ooc_dmr); 

• The Result Organizer Date Manipulation Register (ro.dmr); 
Each of the Data Manipulation Registers can be set up for an 

instruction in one of two ways: 

1. They can be explicitly set using any of the standard methods 
for writing to the co-processor's registers immediately prior to the execution of 
the instruction; or 

2. They can be set up by the oo-processor itself to reflect a current 
instruction. 

During the instruction decoding process, the co-processor examines the 
contents of the Instruction Word and the Data Word of the instruction to 
determine, amongst other things, how to set up the various Data 
Manipulation Registers. Not all combinations of the instruction and 
operands make sense. Several instructions have implied formats for some 
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operands. Instructions that are coded with inconsistent operands may 
complete without error, although any data so generated is "undefined". If the 
•S'bit of the corresponding Data Descriptor is 0, the co-processor sets the Data 
Manipulation Register to reflect the current instruction. 

The format of the Data Manipulation Registers is illustrated in Fig. 
48. The following table sets out the format of the various bits within the 
registers as illustrated in Fig. 48: 



Table 8: Data Manipulation Register Format 



Field 


Description 


ls3 


Lane Swap for byte 3 (most significant bvte) 


ls2 


Lane swap for byte 2 


lsl 


Lane swap for byte 1 


IsO 


Lane swap for byte 0 


suben 


Substitution Enables 

1 = substitute data from Internal Data Register for this 
byte 

0 = do not substitute data from Internal Data Register for 
this byte 


replicate 


Replication Count 

Indicates the number of additional data items to 
generate. 


wrmask 


Write Masks 

0 = write out corresponding byte channel 

1 = do not write out corresponding byte channel 


cmsb 


Choose most significant bits 

0 = choose least significant bits of a byte when performing 
denormalization (useful for halftoning operations) 

1 = choose most significant bits of a byte when performing 
denormalization (useful as inverse of input 
normalization) 
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Field 1 Description 


normalize 


Normalization factor: represents the number of bits Lo 
be translated to abj^te: 

0 = 1 bit data objects 

1 ~ O bit data oWpnfa 

2 = 4 bit data objects 

3 = 8 bit data objects 
4=16 bit data obiects 


bo 


Bit Offset: represents the starting bit address for 
objects smaller than a byte. Bit addressing is big 
ondian. 


P 


External Format: 

0 = unpacked bytes 

1 = packed stream 


if 


Internal Format: 

0 = pixels 

1 = unpacked bytes 

2 = packed bytes 

3 = other 


cc 


Channel count: 

For the Input Organizers this defines the number of 
normalized input bytes collected to form each internal 
data word during component selection. For the Output 
Organizer this defines the number of valid Twtoa f™™ +"u rt 
internal data word that will be sued to construct output 
data. 

0 = 4 active channels 

1 = 1 active channnls 

2 = 2 active channels 

3 = 3 active channels 


L 


! immediate data: 

0 = not long: immediate data f 

1 = long: pointer to data 
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Field 


Description 


what 
1 


addressing mode: 

v xiioti uubiuii bpecuic mode 

1 = sequential addressing 

2 = tile addressing 

3 = constant data, ie, one item of internal data is 
produced, and this item is used repetitively. 



A plurality of internal and external data types may be utilized with 
each instruction. All operand, results and instruction type combinations are 
potentially valid, although typically only a subset of those combinations will 
lead to meaningful results. Particular operand and result data types that 
are expected for each instruction are detailed below in a first table (Table 9) 
summarising the expected data types for external and internal formats: 
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Table 9: Expected Data Types 



ma truction 


Operand A 

(Pixel 
Organizer) 


Operand B 
(Operand 
Organizer B) 


Operand C 
(Operand 
Organizer C) 


Result 
(Result 
Organizer) 


Compositing 


ps 


px 


ps 


px(T) 
bl(B) 


ps 
ub 

const 


ub 


px 
ub 


ps 
ub 


GCSC 


ps 
ift 


ift 


mcsc 

scsc 

(B) 


mcsc 

scsc 

(B) 


mcsc 

scsc 

(B) 


mcsc 

scsc 

(B) 






JPEG comp. 


ps 
us 


Pb 


et 
(B) 


et(B) 


et(B) 


et (B) 


ub 


ps 


JPEG decomp 


ps 


pb 


fdt 
sdt 
<B) 


fdt 
sdt 
(B) 


fdt 

sdt (B) 


frit 

sdt 
(B) 


nVi 
P D 


ps 
ub 


Data coding 


PS 

ub 


px 
Pb 
ub 


et 
fdt 
sdt 
(B) 


et 

fdt 

sdt 

(B) 


et 
fdt 

sdt (B) 


et 
fdt 
sdt 
(B) 


px 
Pb 
ub 


ps 
ub 


J- X QllOJLV A. 11 1 C* UJ. 

ons and 
Convolutions 


skd 
lkd 


skd 
lkd 


it 
(B) 


it(B) 


it(B) 


it(B) 


px 


ps 
ub 


Matrix 

Multiplicatio 

n 


p3 

ub 


px 


mm 
(B) 


mm 
(B) 


mm 
(B) 


mm( 
B) 


px 


ps 
ub 



-354- 



81 



Instruction 


Operand A 

(Pixel 
Organizer) 


Operand B 
(Operand 
Organizer B) 


Operand C 
(Operand 
Organizer C) 


Result 
(Result 
Organizer) 


Halftoning 


ps 
ub 


px 
pb 
ub 


ps 
ub 


px 
pb 
ub 






px 
Pb 

UD 


PS 

ub 


Hierarchial 
Image: 
horizontal 
interpolation 


ps 
ub 


px 
pb 
ub 


- 




- 


- 


px 

pb 

lib 

U.U 


ps 
ub 


Hierarchial 
Image: vertical 
interpolation 
and residual 
merging 


ps 
ub 


px 
pb 
ub 


ps 
ub 


px 
pb 
ub 


- 




px 
pb 
ub 


ps 
ub 


General Memory 
L»opy 


ps 
ub 


px 
Pb 
ub 










px 
pb 
ub 


ps 

Ub i 


Peripheral DMA 


















Internal Access 


















Flow Control 



















The symbols utilized in the above table are as follows: 
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Table 10: Symbol Explanation 



Symbol 


Explanation 


ps 


packed stream 


Pb 


packed bytes 


ub 


unpacked bytes 


px 


pixels 


bl 


blend 


coast 


constant 


mcsc 


4 output channel 


scsc 


1 output channel color conversion table 


ift 


Interval and Fraction tables 


et 


JPEG encoding table 


fdt 


fast JPEG decoding table 


sdt 


slow JPEG decoding table 


skd 


short kernel descriptor 


lkd 


long kernel descriptor 


mm 


matrix co-efficient table 




Symbol 


Explanation { 


it 


image table 1 


(B) 


this organizer in bypass mode for this operation 


(T) 


operand may tile 




no data flows via this operand | 



3.16 Data Normalization Circuit 

Referring to Fig. 49, there is shown a computer graphics processor 
having three main functional blocks: a data normalizer 1062 which may be 
implemented in each of the pixel organizer 246 and operand organizers B and 
C 247, 248, a central graphics engine in the form of the main data path 242 or 
JPEG units 241 and a programming agent 1064, in the form of an instruction 
controller 235. The operation of the data normalizer 1062 and the central 
graphics engine 1064 is determined by an instruction stream 1066 that is 
provided to the programming agent 1064. For each instruction, the 
programming agent 1064 performs a decoding function and outputs internal 
control signals 1067 and 1068 to the other blocks in the system. For each 
input data word 1069, the normalizer 1062 will format the data according to 
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the current instruction and pass the result to the central graphics engine 1063, 
where further processing is performed. 

The data normalizer represents, in a simplified form, the pixel 
organizer and the operand organizers B and C. Each of these organizers 
implements the data normalization circuitry, thereby enabling appropriate 
normalization of the input data prior to it passing to the central graphics 
engine in the form of the JPEG coder or the main data path. 

The central graphics engine 1063 operates on data that is in a 
standard format, which in this case is 32-bit pixels. The normalizer is thus 
responsible for converting its input data to a 32-bit pixel format. The input 
data words 1069 to the normalizer are also 32 bits wide, but may take the 
form of either packed components or unpacked bytes. A packed component 
input stream consists of consecutive data objects within a data word, the data 
objects being 1,2,4,8 or 16 bits wide. By contrast, an unpacked byte input 
stream consists of 32-bit words of which only one 8-bit byte is valid. 
Furthermore, the pixel data 11 produced by the normalizer may consist of 
1,2,3 or 4 valid channels, where a channel is denned as being 8 bits wide. 

Turning now to Fig. 50, there is illustrated in greater detail a 
particular hardware implementation of the data normalizer 1062. The data 
normalization unit 1062 is composed of the following circuits: a First- In- First- 
Out buffer (FIFO) 1073, a 32-bit input register (REGl) 1074, a 32-bit output 
register (REG2) 1076, normalization multiplexors 1075 and a control unit 
1076. Each input data word 1069 is stored in the FIFO 1073 and is 
subsequently latched into REGl 1074, where it remains until all its input 
bits have been converted into the desired output format. The normalization 
multiplexors 1075 consist of 32 combinatorial switches that produce pixels to 
be latched into REG2 by selecting bits from the value in REGl 1074 and the 
current output of the FIFO 1073. Thus the normalization multiplexors 1075 
receive two 32-bit input words 1077, 1078, denoted as x[63..32] and x[31..0]. 
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It has been found that such a method improves the overall throughput 
of the apparatus, especially when the FIFO contains at least two valid data 
words during the course of an instruction. This is typically due to the way in 
which data words originally fetched from memory. In some cases, a desired 
data word or object may be spread across or "wrapped" into a pair of adjacent 
input data words in the FIFO buffer. By using an additional input register 
1074, the normalization multiplexers can reassemble a oomplete input data 
word using components from adjacent data words in the FIFO buffer, thereby 
avoiding need for additional storage or bit-stripping operations prior to the 
main data manipulation stages. This arrangement is particularly 
advantageous where multiple data words of a similar type are inputted to the 
normalizes 

The control unit generates enable signals REG1J2N 20 and 
REG2_EN[3..0] 1081 for updating REG1 1074 and REG2 1076. respectively, 
as well as signals to control the FIFO 1073 and normalization multiplexors 
1075. 

The programming agent 1064 in Fig. 49 provides the following 
configuration signals for the data normalizer 1062: a FIFO_WR 4 signal, a 
normalization factor n[2..0], a bit offset b[2..0], a channel count c(1..0] and an 
external format (E). Input data is written into the FIFO 1073 by asserting the 
FIFO_WR signal 1085 for each clock cycle that valid data is present. The 
FIFO asserts a fifojull status flag 1086 when there is no space available. 
Given 82-bit input data, the external format signal is used to determine 
whether the input is in the format of a packed stream (when E=l) or consists 
of unpacked bytes (when E=0). For the case when E=l, the normalization 
factor encodes the 6ize of each component of a packed stream, namely: n=0 
denotes 1-bit wide components, n=l denotes 2 bits per component, n=2 
denotes 4 bits per component, n=3 denotes 8-bit wide components and n>3 
denotes 16-bit wide components. The channel count encodes the maximum 
number of consecutive input objects to format per clock cycle in order to 
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produce pixels with the desired number of valid bytes. In particular, c=l yields 
pixels with only the least significant byte valid, c=2 denotes least significant 2 
bytes valid, c=3 denotes least signiEcant 3 bytes valid and c=0 denotes all 4 
bytes valid. 

When a packed stream consists of components that are less than 8 
bits wide, the bit offset determines the position in x[31..0], the value stored in 
REG1, from which to begin processing data. Assuming a bit offset relative to 
the most significant bit of the first input byte, the method for producing an 
output data byte yI7..0] is described by the following set of equations :- 

where n = 0 : 

y[i] = x[7-b], where 0 <=i<s=7 
where n = 1 : 

y[i] = x[7-b], where i= 1,3,5,7 

y[i] = x[6-b], where i = 0,2,4,6 

where n = 2 : 

y[3] = x[7-b] 

y[2] = x[6-b] 

y[l] = x[5-b] 

y[0] = x[4-b] 

y[7] = y[3] 

y[6] = y[2] 

y[5] = y[i] 

y[4] = y[0] 
where n = 3 : 

y[i] = x[i], where 0 <=i<=7 
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where n > 3 : 
y[7..0J = xIlo..8J 

Corresponding equations may be used to generate output data bytes 
y[16..8], y[23..16J and y[31..24]. 

The above method may be generalized to produce an output array of 
any length by taking each component of the input stream and replicating it as 
many times as necessary to generate output objects of standard width. In 
addition, the order of processing each input component may be defined as 
little-endian or big-endian. The above example deals with big-endian 
component ordering since processing always begins from the most significant 
bit of an input byte. Little-endian ordering requires redefinition of the bit 
offset to be relative to the least significant bit of an input byte. In situations 
where the input component width exceeds the standard output width, output 
components are generated by truncating each input component, typically by 
removing a suitable number of the least significant bits, hi the above set of 
equations, truncation of 16-bit input components to form 8-bit wide standard 
output is performed by selecting the most significant byte of each 16-bit data 
object. 

The control unit of Fig. 60 performs the decoding of n[2..0] and c[1..0], 
and uses the result along withb[2..0] and E to provide the select signals for 
the normalization multiplexors and the enable signals for REG1 and REG2. 
Since the FIFO may become empty during the course of an instruction, the 
control unit also contains counters that record the current bit position, 
in_bit[4..0], in REGl from which to select input data, and the currant byte, 
out_byte[1..0] ; in REG2 to begin writing output data. The control unit 
detects when it has completed processing each input word by comparing the 
value of in_bit[4..0] to the position of the final object in REGl, and initiates a 
FIFO read operation by asserting the FIFO.RD signal for one clock cycle when 
the FIFO is not empty. The signals fifo.empty and fifo_full denote the FIFO 
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status flags, such that fifc_empty=l when the FIFO contains no valid data, 
and fifo_fuU=l when the FIFO is full. In the same clock cycle that FIFO_RD is 
asserted, REG1_EN is asserted so that new data are captured into REGl. 
There are 4 enable signals for REG2, one for each byte in the output register. 
The control unit calculates REG2_EN[3..0] by taking the minimum of the 
following 3 values : the decoded version of c[1..0], the number of valid 
components remaining to be processed in REGl, and the number of unused 
channels in REG2. When E=0 there is only one valid component in REGl. A 
complete output word is available when the number of channels that have 
been filled in REG2 is equal to the decoded version of c[1..0]. 

In a particularly preferred embodiment of the invention, the circuit 
area occupied by the apparatus in Fig. 50 can be substantially reduced by 
applying a truncation function to the bit offset parameter, such that only a 
restricted set of offsets are used by the control unit and normalization 
multiplexors. The offset truncation depends upon the normalization factor 
and operates according to the following equation: 
b_trunc[2..0] =0, where n >= 3 

= b[2..0], where n = 0 

= b[2..1]&"0", where n=l 

= b[2]&"00", where n = 2 
(Note that "&" denotes bitwize concatenation). 

The. above method allows each of the normalization multiplexors, 
denoted in Fig. 50 byMUXO, MUXl ... MUX31, to be reduced from 32-to-l in 
size when no truncation is applied, to be a maximum size of 20-to-l with bit 
offset truncation. The size reduction in turn leads to an improvement in circuit 
speed. 

It can be seen from the foregoing that the preferred embodiment 
provides an efficient circuit for the transformation of data into one of a few 
normalized forms. 
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3.17 Image Processing Operations of Accelerator Card 

Returning again to Fig. 2 and Table 2, as noted previously, the 
instruction controller 235 "executes" instructions which result in actions being 
performed by the co-proceseor 224. The instructions executed include a 
number of instructions for the performance of useful functions by the main 
data path unit 242. A first of these useful instructions is compositing. 
3.17.1 Compositing 

Referring now to Fig. 51, there is illustrated the compositing model 
implemented by the main data path unit 242. The compositing model 462 
generally has three input sources of data and the output data or sink 463. 
The input sources can firstly include pixel data 464 from the same destination 
within the memory as the output 463 is to be written to. The instruction 
operands 465 can be utilized as a data source which includes the color and 
opacity information. The color and opacity can be either flat, a blend, pixels 
or tiled. The flat or blend is generated by the blend generator 467, as it is 
quicker to generate them internally than to fetch via input/output. 
Additionally, the input data can include attenuation data 466 which 
attenuates the operand data 465. The attenuation can be flat., bit map or a 
byte map. 

As noted previously, pixel data normally consists of four channels with 
each channel being one byte wide. The opacity channel is considered to be the 
byte of highest address. For an introduction to the operation and usefulness 
of compositing operations, reference is made to the standard texts including 
the seminal paper by Thomas Porter and Tom Duff "Compositing Digital 
Images" in Computer Graphics Volume 18, Number 3, July 1984. 

The co-processor can utilize pre-multiplied data. Pre- multiplication 
can consist of pre-multiplying each of the colored channels by the opacity 
channel. Hence, two optional pre-multiplication units 468, 469 are provided 
for pre-multiplying the opacity channel 470, 471 by the colored data to form, 
where required, pre-multiplied outputs 472, 473. A compositing unit 475 
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implements a composite of its two inputs in accordance with the current 
instruction data. The compositing operators are illustrated in Table 11 
below: 



Table 11: Compositing Operations 



Operator 


Definition 


( a co' a o) over 0>co> h o) 


( a oo +b co< 1 - a o)» a o +b o( 1 - a o)) i 


< a co> a o> m ftco'V 




( a co a o) out Q>eo h o> 


<a 0 o(l-b 0 ).a 0 (l-b 0 )) 


( a co> a o> at °P (tco> b o) 


( a oo b o +b co( 1 - a o)» b o) 




Operator 


] Definition 
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(a co (]-b 0 )+b co (l-a 0 ) ) a 0 (l-b 0 )+b 0 (l-a 0 )r" 


( a co- a o)P lus ( b co' b o> 


(wc(a co +b C0 .r(a o +b o - 




255)/255)4r(clamp(a 0 -«-b 0 )- 




255)/255,clamp(a 0 +b t) )) 


( a co.a 0 ) loadzero (b co ,b 0 ) 


(0,0) 


(^^o) loadc (boo- b o) 


Ooo' a o) 


( a co> a o> loado 


( a co- b o) 


(a^a^ loadco (b co ,b^) 





The nomenclature (a^, refers to a pre-multiplied pixel of color a 

c 

and opacity a Q . R is an offset value and "wc" is a wrapping/clamping 
operator whose operation is explained below. It should be noted that the 
reverse operation of each operator in the above table is also implemented by a 
composting unit 475. 

A clamp/wrapping unit 476 is provided to clamp or wrap data around 
the limit values 0-255. Further, the data can be subjected to an optional 
"unpre-multiplication M 477 restoring the original pixel values as required. 
Finally, output data 463 is produced for return to the memory. 

In Fig. 52, there is illustrated the form of an instruction word directed 
to the main data path unit for composting operations. When the X field in 
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the major op-code is 1, this indicates a plus operator is to be applied in 
accordance with the aforementioned table. When this field is 0, another 
instruction apart from the plus operator is to be applied. The P a field 
determines whether or not to pre-rnultiply the first data stream 464 (Fig. 51). 
Tho P b field determines whether or not to pre-rnultiply the second data 
stream 465. The P r field determines whether or not to "unpre-multiply" the 
result utilising unit 477. The C field determines whether to wrap or clamp, 
overflow or underflow in the range 0-255. The n oom-code" field determines 
which operator is to be applied. The plus operator optionally utilizes an 
offset register (mdp_por). This offset is subtracted from the result of the 
plus operation before wrapping or clamping is applied. For plus operators, 
the corn-code field is interpreted as a per channel enablement of the offset 
register. 

The standard instruction word encoding 280 of Fig. 10 previously 
discussed is altered for composting operands. As the output data 
destination is the same as the source, operand A will always be the same 
operand as the result word so operand A can be utilized in conjunction with 
operand B to describe at greater length the operand B. As with other 
instructions, the A descriptor within the instructions still describes the 
format of the input and the R descriptor defines the format of the output. 

Turning now to Fig. 53, there is illustrated in a first example 470, the 
instruction word format of a blend instruction. A blend is defined to have a 
start 471 and end value 472 for each channel. Similarly, in Fig. 54 there is 
illustrated 475 the format of a tile instruction which is defined by a tile 
address 476 a start offset 477, a length 478. All tile addressee and 
dimensions are specified in bytes. Tiling is applied in a modular foahion and, 
in Fig. 55, there is shown the interpretation of the fields 476 - 478 of Fig. 54. 
The tile address 476 denotes the start address in memory of the tile. A tile 
start offset 477 designates the first byte to be utilized as a start of the tile. 
The tile length 478 designates the total length of the tile for wrap around. 
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Returning to Fig. 51, every color component and opacity can be 
attenuated by an attenuation value 466. The attenuation value can be 
supplied in one of three ways: 

1. Software can specify a flat attenuation by placing the 
attenuation factor in the operand C word of the instruction. 

2. A bit map attenuation where 1 means fully on and 0 means 
fully off can be utilized with software specifying the address of the bit map in 
the operand C word of the instruction. 

3. Alternatively, a byte map attenuation can be provided again 
with the address of the byte map in operand C. 

Since the attenuation is interpreted as an unsigned integer from 0- 
265, the pre -multiplied color channel is multiplied by the attenuation factor 
by effectively calculating: 

C oa = C oa xA/255 

Where A is the attenuation and C Q is the pre-multiplied color channel. 
3.17.2 Color Space Conversion Instructions 

Returning again to Fig. 2 and Table 2. the main data path unit 242 
and data cache 230 are also primarily responsible for color conversion. The 
color space conversion involves the conversion of a pixel stream in a first color 
space format, for example suitable for RGB color display, to a second color 
space format, for example suitable for CYM or CYMK printing. The color 
space conversion is designed to work for all color spaces and can be used for 
any function from at least one to one or more dimensions. 

The instruction controller 235 configures, via the Cbus 231, the main 
data path unit 242, the data cache controller 240, the input interface switch 
252, the pixel organizer 246, the MUV buffer 250, the operand organizer B 
247, the operand organizer C 248 and the result organizer 249 to operate in 
the color conversion mode. In this mode, an input image consisting of a 
plurality of lines of pixels is supplied, one line of pixels after another, to the 
main data path unit 242 ae a stream of pixels. The main data path unit 242 
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(Fig. 2) receives the stream of pixels from, the input interface switch 252 via 
the pixel organizer 246 for color space conversion processing one pixel at a 
time, hi addition, interval and fractional tables are pre-loaded into the 
MUV buffer 250 and color conversion tables are loaded into the data cache 
230. The main data path unit. 242 accesses these tables via the operand 
organizers B and C, and converts these pixels, for example from the RGB color 
space to the CYM or CTYMK color space and supplies the converted pixels to 
the result organizer 249. The main data path unit 242, the data cache 230, 
the data controller 240 and the other abovementioned devices are able to 
operate in either of the following two modes under control of the instruction 
controller 235; a Single Output General Color Space (SOGCS) Conversion 
mode or a Multiple Output General Color Space (MOGCS) Conversion Mode. 
For more details on the data cache controller 240 and data cache 230, 
reference is made to the section entitled Data Cache Controller and Cache 
240, 230 (Fig. 2). 

Accurate color space conversion can be a highly non-linear process. 
For example, color space conversion of a RGB pixel to a single primary color 
component (e.g. cyan) of the CYMK color space is theoretically linear, however 
in practice non-linearities are introduced typically by the output device which 
is used to display the colour components of the pixel. Similarly for the color 
space conversion of the RGB pixel to the other primary color components 
(yellow, magenta or black) of the CYMK color space. Consequently a non- 
linear colour space conversion is typically used to compensate for the non- 
linearities introduced on each colour component. The highly non-linear 
nature of the color conversion process requires either a complex transfer 
function to be implemented or a look-up table to be utilized. Given an input 
color space of, for example, 24 bit RGB pixels, a look-up table mapping each of 
these pixels to a single 8 bit primary color component of the CYMK color space 
(i.e. cyan) would require over 16 megabytes. Similarly, a look-up table 
simultaneously mapping the 24 bit RGB pixels to all four 8 bit primary color 
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components of the CYMK color space would require over 64 megabytes, which 
is obviously excessive. Instead, the main data path 242 (Fig. 2) uses a look- 
up table stored in the data cache 230 having sparsely located output oolor 
values corresponding to points in the input color space and interpolates 
between the output color values to obtain an intermediate output, 
a. Single Output General Color Space (SOGCS) Conversion Mode 
In both the single and multiple output color conversion modes 
(SOGCS) and (MOGCS), the RGB color space is comprized of 24 bit pixels 
having 8 bit red, green and blue color components. Each of the RGB 
dimensions of the RGB color space is divided into 15 intervals with the length 
of each interval having a substantially inverse proportionality to the non- 
linear behavior of the transfer function between the RGB to CYMK color space 
of the printer. That is, where the transfer function has a highly non-linear 
behavior the interval size is reduced and where the transfer function has a 
more linear behavior, the size of the interval is increased. Preferably, the 
color space of each output printer is accurately measured to determine those 
non-linear portions of its transfer function. However, the transfer function 
can be approximated or modelled based on know-how or measured 
characteristics of a type printer (e.g.: ink-jet). For each color channel of an 
input pixel, the color component value defines a position within one of the 15 
intervals. Two tables are used by the main data path unit 2d2 to determine 
which interval a particular input color component value lies within and also to 
determine a fraction along the interval in which a particular input color 
component value lies. Of course, different tables may be used for output 
printers having different transfer functions. 

As noted previously, each of the RGB dimensions is divided into 15 
intervals. In this way the RGB color space forms a 3-dimensional lattice of 
intervals and the input pixels at the ends of the intervals form sparsely 
located points in the input color space. Further, only the output color values 
of the output color space corresponding to the endpoints of the intervals are 
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stored in look-up tables. Hence, an output color value of an input color pixel 
can be calculated by determining the output color values corresponding to the 
endpoints of the intervals within which the input pixel lies and interpolating 
such output color values utilising the fractional values. This technique 
reduces the need for large memory storage. 

Turning now to Fig. 56, there is illustrated 480 an example of 
determining for a particular input RGB color pixel, the corresponding interval 
and fractional values. The conversion process relies upon the utilization of 
an interval table 482 and a fractional table 483 for each 8 bit input color 
channel of the 24 bit input pixel. The 8 bit input color component 48 1 , shown 
in a binary form in Fig. 56 having the example decimal number 4, is utilized 
as a look-up to each of the interval and fractional tables. Hence, the number 
of entries in each table is 256. The interval table 482 provides a 4 bit output 
defining one of the intervals numbered 0 to 14 into which the input color 
component value 481 falls. Similarly, the fractional table 483 indicates the 
fraction within an interval that the input color value component 481 falls. 
The fractional table stores 8 bit values in the range of 0 to 265 which are 
interpreted as a fraction of 256. Hence, for an input color value component 
48 1 having a binary equivalent to the decimal value 4, this value is utilized to 
look-up the interval table 482 to produce an output value of 0. The input, 
value 4 is also utilized to look-up the fractional table 483 to produce an 
output value of 160 which designates the fraction 160/256. As can be seen 
from the interval and fractional tables 482 and 483, the interval lengths are 
not equal. As noted previously, the length of the intervals are chosen 
according to the non-linear behavior of the transfer function. 

As mentioned above, the separate interval and fractional tables are 
utilized for each of the RGB color components resulting in three interval 
outputs and three fractional outputs. Each of the interval and fractional 
tables for each color component are loaded in the MUV buffer 250 (Fig. 2) and 
accessed by the main data path unit 242 when required. The arrangement of 
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the MUV buffer 250 for the color conversion process is as shown in Fig. 57. 
The MUV buffer 250 (Fig. 57) is divided into three areas 488, 489 and 490, 
one area for each color component. Each area e.g. 488 is further divided into 
a 4 bit interval table and a 8 bit fractional table. A 12 bit output 492 is 
retrieved by the main data path unit 242 from the MUV buffer 250 for each 
input color channel. In the example given above of a single input color 
component having a decimal value 4, the 12 bit output will be 000001010000. 

Turning now to Fig. 58, there is illustrated an example of the 
interpolation process. The interpolation process consists primarily of 
interpolation from one three dimensional space 500, for example RGB color 
space to an alternative color space, for example CMY or CMYK The pixels 
P0 to P7 form sparsely located points in the RGB input color space and having 
corresponding output color values CV(P0) to CV(P7) in the output color space. 
The output color component value corresponding to the input pixel Pi falling 
between the pixels P0 to P7 is determined by; firstly, determining the 
endpoints PO, P1,...,P7 of the intervals surrounding the input pixel Pi; 
secondly, determining the fractional components frac_r, frac_gand fracjb; and 
lastly interpolating between the output color values CV(P0) to CV(P7) 
corresponding to the endpoints PO to P7 using the fractional components. 

The interpolation process includes a one dimensional interpolation in 
the red (R) direction to calculate the valueB temp 11, temp 12, temp 13, temp 
14 in accordance with the following equations: 

temp 11 = CV(P0) + frac_r (CV(P1) - CV(PO)) 
temp 12 = CV(P2) + frac_r (CV(P3) - CV(P2)) 
temp 13 = CV(P4) + fracr (CV(P5) - CV(P4)) 
temp 14 = CV(P6) + fracjr (CV(P7) - CV(P6)) 
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Next, the interpolation process includes the calculation of a further one 
dimensional interpolation in the green (G) direction utilising the following 
equations to calculate the values temp 21 and temp 22: 

temp 21 = temp 11 + frac_g (temp 12 - temp 11) 
temp 22 = temp 13 + frac_g (temp 14 - temp 13) 

Finally, the final dimension interpolation in the blue (B) direction is 
carried out to calculate a final color output value in accordance with the 
following equation. 

final = temp 21 + fragjb (temp 22 - temp 21) 

Unfortunately, it is often the case that the input and output gamut 
may not match. In this respect, the output gamut may be more restricted 
that the input gamut and in this case, it is often neoessaiy to clamp the 
gamut at the extremes. This often produces unwanted artefacts when 
converting using the boundary gamut colors. An example of how this problem 
can occur will now be explained with reference to Fig. 59, which represents a 
one dimensional mapping of input gamut values to output gamut values. It 
is assumed that output values are defined for the input values at points 510 
and 511. However, if the greatest output value is clamped at the point 512 
then the point 511 must have an output value of this magnitude. Hence, 
when interpolating between the two points 510 and 511, the line 515 forms 
the interpolation line and the input point 516 produces a corresponding 
output value 517. However, this may not be the best color mapping, 
especially where, without the gamut limitations, the output value would have 
been at the point 518. The interpolation line between 510 and 518 would 
produce an output value of 519 for the input point 516. The difference 
between the two output values 517 and 519 can often lead to unsightly 
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artefacts, particularly when printing edge of gamut colors. To overcome this 
problem, the main data path unit can optionally calculate in an expanded 
output color space and then scale and clamp to the appropriate range utilising 
the following formula: 

0 if x<63 
out= 2(x-64)if (64<x<191) (4) 

265 if (192<x) 

Returning now to Fig. 58, it will be evident that the interpolation 
process can either be carried out in the SOCGS conversion mode which 
converts RGB pixels to a single output color component (for example, cyan) or 
the MOGCS mode which converts RGB pixels to all the output color 
components simultaneously. Where color conversion is to be carried out for 
each pixel in an image, many millions of pixels may have to be independently 
color converted. Hence, in order for high speed operation, it is desirable to be 
able to rapidly locate the 8 values (P0-P7) around a particular input value. 

As noted previously with respect to Fig. 57, the main data path unit 
242 retrieves for each color input channel, a 12 bit output consisting of a 4 bit 
interval part and a 8 bit fractional part. The main data path unit 242 
concatenates thesG 4 bit interval parts of the red, green and blue color 
channels to form a single 12 bit address (Ir, Iq, Ijj), as shown in Fig. 60 as 
520. 

Fig. 60 shows a data flow diagram illustrating the manner in which a 
single output color component 563 is obtained in response to the single 12 bit 
address 520. The 12 bit address 520 is first fed to an address generator of 
the data cache controller 240, such as the generator 1881 (shown in Fig. 141) 
which generates 8 different 9 bit line and byte addresses 521 for memory 
banks (Bq, Bi„JB7). The data cache 230 (Fig, 2) is divided into 8 
independent memory banks 522 which can be independently addressed by the 
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respective 8 lino and byte addresses. The 12 bit address 520 is mapped by 
the address generator into the 8 line and byte addresses in accordance with 
the following table: 



Table 12: Address Composition for SOGCS Mode 
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where BIT18:6], BIT[5:3] and BIT[2:0J represent the sixth to eighth 
bits, the third to fifth bits and the zero to second bits of the 9 bit bank 
addressee respectively; and 

where R[3:l], G[3:l] and B[3:l] represent the first to third bits of the 4 
bit intervals Ir, 1q and Ig of the 12 bit address 520 respectively. 

Reference is made to memory bank 5 of Tabic 12 for a more detailed 
explanation of the 12 bit to 9 bit mapping. In this particular case, the bits 1 
to 3 of the 4 bit red interval I r of the 12 bit address 520 are mapped to bits 6 
to 8 of the 9 bit address B5; bits 1 to 3 and bit 0 of the 4 bit green interval Ig 
are summed and then mapped to bits 3 to 5 of the 9 bit address B5; and bits 1 
to 3 of the 4 bit blue interval Ifo arc mapped to bits 0 to 2 of the 9 bit address 
B5. 

Each of the 8 different line and byte addresses 521 is utilized to 
address a respective memory bank 522 which consists of 512 x 8 bit entries, 
and the corresponding 8 bit output color component 523 is latched for each of 
the memory banks 522. As a consequence of this addressing method, the 
output color values of CV(PO) to CV(P7) corresoponding to the endpoints PO to 
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P7 maybe located at different positions in the memory banks. For example, 
a 12 bit address of 0000 0000 0000 will result in the same bank address for 
each bank, ie 000 000 000. However a 12 bit address of 0000 0000 0001 will 
result in different bank addresses, ie a bank address of 000 000 000 for banks 
7, 5, 3 and 1 and a bank address of 000 000 001 for banks 6, 4, 2 andO. It is 
in this way the eight single output color values CV(PO)-CV(P7) surrounding a 
particular input pixel value are simultaneously retrieved from respective 
memory banks and duplication of output color values in the memory banks 
can be avoided 

Turning now to Fig. 61, there is illustrated the structure of a single 
memory bank of the data cache 230 when utilized in the single color 
conversion mode. Each memory bank consists of 128 line entries 531 which 
are 32 bits long and comprize 4x8 bit memories 533 - 636. The top 7 bits of 
the memory address 521 are utilized to determine the corresponding row of 
data within the memory address to latch 542 as the memory bank output. 
The bottom two bits are a byte address and are utilized as an input to 
multiplexer 543 to determine which of the 4x8 bit entries should be chosen 
544 for output. One data item is output for each of the 8 memory banks per 
clock cycle for return to the main data path unit 242. Hence, the data cache 
controller receives a 12 bit byte address from the operand organizer 248 (Pig. 
2) and outputs in return to the operand organizers 247, 248, the 8 output color 
values for interpolation calculation by tho main data path unit 242. 

Returning to Fig. 60, the interpolation equations are implemented by 
the main data path unit 242 (Fig. 2) in three stages. In the main data path 
unit, a first stage of multiplier and adder units eg. 550 which take as input 
the relevant color values output by the corresponding memory banks eg. 522 in 
addition to the red fractional component 551 and calculate the 4 output 
values in accordance with stage 1 of the abovementioned equations. The 
outputs eg. 553, 554 of this stage are fed to a next stage unit 556 which 
utilizes the frac_g input 557 to calculate an output 558 in accordance with the 
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aforementioned equation for stage 2 of the interpolation process. Finally, the 
output 558 in addition to other outputs eg. 559 of this stage ore utilized 560 
in addition to the frao_h input 562 to calculate a final output color 563 in 
accordance with the aforementioned equations. 

The process illustrated in Fig. 60 is implemented in a pipelined 
manner so as to ensure maximum overall throughput. Further, the method 
of Fig. 60 is utilized when a single output color component 563 is required. 
For example, the method of Fig. 60 can be utilized to first produce the cyan 
color components of an output image followed by the magenta, yellow and 
black components of an output image reloading the cache tables between 
passes. This is particularly suitable for a four-pass printing process which 
requires each of the output colors as part of separate pass, 
b. Multiple Output General Color Space Mode 

The co-processor 224 operates in the MOGCS mode in a substantially 
similar manner to the SOCGS mode, with a number of notable exceptions. In 
the MOGCS mode, the main data path unit 242, the data cache controller 240 
and data cache of Fig. 2 co-operate to produce multiple color outputs 
simultaneously with four primary colors components being output 
simultaneously. This would require the data cache 230 to be four times 
larger in size. However, in the MOGCS mode of operation, in order to save 
storage space, the data cache controller 240 stores only one quarter of all the 
output color values of the output color space. The remaining output color 
values of the output color space are stored in a low speed external memory 
and are retrieved as required. This particular apparatus and method is 
based upon the surprising revelation that the implementation of sparsely 
located color conversion tables in a cache system have an extremely low miss 
rate. This is based on the insight there is a low deviation in color values 
from one pixel to the next in most color images. In addition, there is a high 
probability the sparsely located output color values will be the same for 
neighboring pixels. 
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Turning now to Fig. 62 there will now be described the method carried 
out by the co-processor to implement multi-channel cached color conversion. 
Each input pixel is broken into its color components and a corresponding 
interval table value (Fig. 56) is determined as previously described resulting 
in the three 4 bit intervals fr, Ig, lb denoted 570. The combined 12 bit 
number 570 is utilized in conjunction with the aforementioned table 12 to 
again derive eight 9-bit addresses. The addresses eg. 572 are then 
re-mapped as will be discussed below with reference to Fig. 63, and then are 
utilized to look up a corresponding memory bank 573 to produce four colour 
output channels 574. The memory bank 573 stores 128x32 bit entries out of 
a total possible 512x32 bit entries. The memory bank 573 forms part of the 
data cache 230 (Fig. 2) and is utilized as a cache as will now be described with 
reference to Fig. 63. 

Turning to Fig. 63, the 9 bit bank input 578 is re-mapped as 579 so as 
to anti-alias memory patterns by re-ordering the bits 580-582 as illustrated. 
This reduces the likelihood of neighboring pixel values aliasing to the same 
cache elements. 

The reorganized memory address 579 is then utilized as an address 
into the corresponding memory bank eg. 585 which comprizes 128 entries each 
of 32 bits. The 7 bit line address is utilized to access the memory 585 
resulting in the corresponding output being latched 586 for each of the 
memory banks. Each memory bank, eg 585 has an associated tag memory 
which comprizes 128 entries each of 2 bite. The 7 bit line address is also 
utilized to access the corresponding tag in tag memory 587. The two most 
significant bits of the address 579 are compared with the corresponding Lag in 
tag momory 587 to determine if th© relevant output color value is Btored in the 
cache. These two most significant bits of the 9 bit address correspond to the 
most significant bits of the red and green data intervals (see Table 12). Thus 
in the MOGCS mode the RGB input color space is effectively divided into 
quadrants along the red and green dimensions where the two most significant 
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bits of the 9 bit address designates the quadrant of the RGB input color space. 
Hence the output color values are effectively divided into four quadrants each 
designated by a two bit tag. Consequently the output color values for each 
tag value for a particular line are highly spaced apart in the output color space, 
enabling anti-aligning of memory patterns. 

Where the two bit tags do not match a cache mi6s is recorded by the 
data cache controller and the corresponding required memory read is initiated 
by the data cache controller with the cache look up process being stalled until 
all values for that line corresponding to that two bit tag entry are read irom 
an external memory and stored in the cache. This involves the reading of the 
relevant line of the color conversion table stored in the external memory. The 
process 575 of Fig. 63 is carried out for each of the memory banks eg. 573 of 
Fig. 62 resulting, depending on the cache contents, in a time interval elapsing 
before the results eg. 586 are output from each corresponding memory bank. 
Each of the eight 32 bit sets of data 586 are then forwarded to the main data 
path unit (242) which carries out the aforementioned interpolation prooess 
(Fig. 62) in three stages 590 - 592 to each of the colored channels 
simultaneously and in a pipelined manner so as to produce four color outputs 
595 for sending to a printer device. 

Experiments have shown that the caching mechanism as described 
with reference to Figs. 62 and 63 can be advantageously utilized as typical 
images have a cache miss-rate on average requiring between 0.01 and 0.03 
cache line fetches per pixel. The utilization of the caching mechanism 
therefore leads to substantially reduced requirements, in the typical case, for 
memory accesses outside of the data cache. 

The instruction encoding for both color space conversion modes (Fig. 
10) utilized by the co-processor has the following structure: 
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Table 12A: Instruction Encoding for Color Space Conversion 



rz — - 

1 Operand 


Description 


Internal 
Format 


1 

External 

Format 


Operand A 


source pixels 


pixels 


packed stream 


Operand B 


multi output channel 
color conversion 
tables 


other 


multi channel esc 
tables 


Operand C 


Interval and Fraction 
Tables 




I&F table format 


Result i 


pixels 


pixels 


packed stream 




bytes 


unpacked bytes 


unpacked bytes, 
packed stream 



The instruction field encoding for color space conversion instruction is 
illustrated in Pig. 64 with the following minor opcode encoding for the color 
conversion instructions. 



Table 13: Minor Opcode Encoding for Color Conversion 



Instructions 



Field 


Description 


trans[3:0] 


0 = do not apply translation and clamping step to 
corresponding output value on this channel 


M 


0 = single channel color tabl© format 

1 = multi channel color table format | 



Fig. 65 shows a method of converting a stream of RGB pixels into 
CYMK color values according to the MOGCS mode. In step Si, a stream of 
24 bit RGB pixels are received by the pixel organiser 246 (Fig. 2). Instep S 2> 
the pixel organiser 246 determines the 4 bit interval values and the 8 bit 
fractional values of each input pixel from lookup tables, in the manner 
previously discussed with respect to Figs. 56 and 57. The interval and 
fractional values of the input pixel designate which intervals and fractions 
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along the intervals in which the input pixel lies. In step S 3 , the main data 
path unit 242 concatenates the 4 bit intervals of the red, green and blue color 
components of the input pixel to form a 12 bit address word and supplies this 
12 bit address word to the data cache controller 240 (Fig. 2). In step S 4 , the 
data cache controller 240 converts this 12 bit address word into 8 different 9 
bit addresses, in the manner previously discussed with respect to Table 12 
and Fig, 62. These 8 different addresses designate the location of the 8 
output color values CV(P0)-CV(P7) in the respective memory banks 573 (Fig. 
62) of the data cache 230 (Fig, 2). In step S 5 , the data cache controller 240 
(Fig. 2) remaps the 8 different 9 bit addresses in the manner described 
previously with respect to Fig. 63 . In this way, the most significant bit of the 
red and green 4 bit intervals are mapped to the two most significant bits of 
the 9 bit addresses . 

In step S 6 , the data cache controller 240 then compares the two most 
significant bits of the 9 bit addresses with respective 2 bit tags in memory 
587 (Fig. 63). If the 2 bit tag does not correspond to the two most significant 
bits of the 9 bit addresses, then the output color values CV(P0)-CV(P7) do not 
exist in the cache memory 230, Hence, in step S 7 , all the output color values 
corresponding to the 2 bit tag entry for that line are read from external 
memory into the data cache 230. If the 2 bit tag corresponds to these two 
most significant bits of the 9 bit addresses, then the data cache controller 240 
retrieves in step S 8 the eight output color values CV(P0)-CV(P7) in the 
manner discussed previously with respect to Fig. 62. In this way, the eight 
output color values CV(P0)-CV(P7) surrounding the input pixel are retrieved 
by the main data path unit 242 from the data cache 230. In step S 7 , the 
main data path unit 242 interpolates the output color values CV(P0)-CV(P7) 
utilising the fractional values determined in step S 2 and outputs the 
interpolated output color values. 
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It will be evident to the man skilled in the art, that the storage space 
of the data cache storage may be reduced further by dividing the RGB color 
space and the corresponding output color values into more than four 
quadrants, for example 32 blocks. In the latter case, the data cache can have 
the capacity of storing only a 1/32 block of output color values. 

It will also be evident to the man skilled in the art, that the data 
caching arrangement utilized in the MOGCS mode can also be used in a single 
output general conversion mode. Hence, in the latter mode the storage space 
of the data cache can also be reduced. 
3.17.3JPEG Coding/Decoding 

It is well known that a large number of advantages can be obtained 
from storing images in a compressed format especially in relation to the 
saving of memory and the speed of transferring images from one place to 
another. Various popular standards have arizen for image compression. 
One very popular standard is the JPEG standard and for a full discussion of 
the implementation of this standard reference is made to the well known text 
JPEG: Still Image Data Compression Standardby Pennebaker and Mitchell 
published 1993 by Van Nostrand Reinhold. The co-processor 224 utilizes a 
subset of the JPEG standard in the storage of images. The JPEG standard 
has the advantage that large factor compression can be gained with the 
retention of substantial image quality. Of course, other standards for storing 
compressed images could be utilized. The JPEG standard is well-known to 
those skilled in the art, and the various JPEG alternative implementations 
readily available in the marketplace from manufacturers including JPEG core 
products for incorporation into ASICS. 

The co-processor 224 implements JPEG compression and 
decompression of images consisting of 1, 3 or 4 color components. One-color- 
component images may be meshed or unmeshed. That is, a single-color- 
component can be extracted from meshed data or extracted from unmeshed 
data. An example of meshed data is three-color components per pixel datum 
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(i.e., RGB per pixel datum), and an example of unmeshed data is where each 
color component for an image is stored separately such that each color 
component can be processed separately. For three color component images 
the co-processor 224 utilizes one pixel per word, assuming the three color 
channels to be encoded in the lowest three bytes. 

The JPEG standard decomposes an image into small two dimensional 
units called minimum coded units (MCU). Each minimal coded unit is 
processed separately. The JPEG coder 241 (Fig. 2) is able to deal with 
MCU's which are 16 pixels wide and 8 pixels high for down sampled images or 
MCU's which are 8 pixels wide and 8 pixels high for images that are not to be 
down sampled. 

Turning now to Fig. 66, there is illustrated the method utilized for 
down sampling three component images. 

The original pixel data 600 is stored in the MUV buffer 250 (Fig. 2) in 
a pixel form wherein each pixel 601 comprizes Y, U and V components of the 
YUV color space. This data is first converted into a MCU unit which 
comprizes four data blocks 601 - 604. The data blocks comprize the various 
color components, with the Y component being directly sampled 601, 602 and 
the U and V components being sub-sampled in the particular example of Fig. 
13 to form blocks 603, 604. Two forme of sub-sampling are implemented by 
the co-processor 224, including direct sampling where no filtering is applied 
and odd pixel data is retained while even pixel data is discarded. 
Alternatively, filtering of the U and V components can occur with averaging of 
adjacent values taking place. 

An alternative form of JPEG sub-sampling is four color channel sub- 
sampling as illustrated in Fig. 67. In this form of sub-sampling, pixel data 
blocks of 16 x 8 pixels 610 each have four components 611 including an opacity 
component (O) in addition to the usual Y, U, V components. This pixel data 
410 is sub-sampled in a similar manner to that depicted in Fig. 66 
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However, in this case, the opacity channel is utilized to form data 
blocks 612, 613. - 

Turning now to Fig. 68, there is illustrated the JPEG coder 241 of Fig. 
2 in more detail. The JPEG encoder/decoder 241 is utilized for both JPEG 
encoding and decoding. The encoding process receives block data via bus 620 
from the pixel organizer 246 (Fig. 2). The block data is stored within the 
MUV buffer 250 which is utilized as a block staging area. The JPEG 
encoding process is broken down into a number of well defined stages. These 
stages include: 

1. taking a discrete cosine transform (DCT) via DCT unit 621; 

2. quantising the DCT output 622; 

3. placing the quantized DCT co-efficients in a zig zag order, also 
carried out by quantizer unit 622; 

4. predictively encoding the DC DCT co-efficients and run length 
encoding the AC DCT co-efficients carried out by co-efficient coder 623; and 

5. variable length encoding the output of the co-efficients coder 
stage, carried out by Huffman coder unit 624. The output is fed via 
multiplexer 625 and Rhus 626 to the result organizer 629 (Fig. 2). 

The JPEG decoding process is the inverse of JPEG encoding with the 
order of operations reversed. Hence, the JPEG decoding process comprizes 
the steps of inputting on Bus 620 a JPEG block of compressed data. The 
compressed data is transferred via Bus 630 to the Huffman coder unit 624 
which Huffman decodes data into DC differences and AC run lengths. Next, 
the data is forwarded to the co-efficients coder 623 which decodes the AC and 
DC co-efficients and puts them into their natural order. Next, the quantizer 
unit 622 dequantizes the DC oo-efflcients by multiplying them by a 
corresponding quantization value. Finally, the DCT unit 621 applies an 
inverse discrete cosine transform to restore the original data which is then 
transferred via Bus 631 to the multiplexer 625 for output via Bus 626 to the 
Result Organizer, The JPEG coder 241 operates in the usual manner via 
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standard CBus interface 632 which contains the registers set by the 
instructions controller in order to begin operation of the JPEG coder. Further, 
both the quantizer unit 622 and the Huffman coder 624 require certain tables 
which are loaded in the data cache 230 as required. The table data is 
accessed via an OBus interface unit 634 which connects to the operand 
organizer B unit 247 (Fig. 2) which in turn interacts with the data cache 
controller 240. 

The DCT unit 621 implements forward and inverse discrete cosine 
transforms on pixel data. Although many different types of DCT 
transforming implementations are known and discussed in the Still Image 
Data Compression Standard (ibid), the DCT 621 implements a high speed 
form of transform more fully discussed in the section herein entitled A Fast 
DCT Apparatus, which may implement a DCT transform operation in 
accordance with the article entitled A Fast DCT-SQ Scheme for Images by 
Arai et. al, published in The Transactions of the 1EICE, Vol E71, No. 11, 
November 1988 at page 1095. 

The quantizer 622 implements quantization and dequantization of 
DCT components and operates via fetching relevant values from 
corresponding tables stored in the data cache via the OBus interface unit 634. 
During quantization, the incoming data stream is divided by values read from 
quantization tables stored in the data cache. The division is implemBnted 
as a fixed point multiply. During dequantization, the data stream is 
multiplied by values kept in the dequantization table. 

Turning to Fig. 69, there is illustrated the dequantizer 622 in more 
detail The quantizer 622 includes a DCT interface 640 responsible for 
passing data to and receiving data from the DCT module 621 via a local Bus. 
During quantization, the quantizer 622 receives two DCT co-efficients per 
clock cycle. These values are written to one of the quantizers internal buffers 
641, 642. The buffers 641, 642 are dual ported buffers used to buffer 
incoming data. During quantization, co-efficient data from the DCT sub- 
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module 621 is placed into one of the buffers 641, 642. Once the buffer is full, 
the data is read from the buffer in a zig zag order and multiplied by multiplier 
643 with the quantization values received via OBus interface unit 634. The 
output is forwarded to the co-efficient coder 623 (Fig. 68) via co-efficient coder 
interface 645. While this is happening, the next block of co-efficients is being 
written to the other buffer. During JPEG decompression, the quantizer 
module dequantizes decoded DCT co-efficients by multiplying them by values 
stored in the table. As the quantization and dequantization operations are 
mutually exclusive, the multiplier 643 is utilized during quantization and 
dequantization. The position of the co-efficient within the block of 8x8 values 
is used as Lhe index into the dequantization table. 

As with quantization, the two buffers 641, 642 are utilized to buffer 
incoming co-efficient data from the co-efficient coder 623 (Fig. 68). The data 
is multiplied with its quantization value and written into the buffers in 
reverse zig zag order. Once full, the dequantized co-efficients are read out of 
the utilized buffer in natural order, two at a time, and passed via DOT 
interface 640 to the DCT sub-module 621 (Fig. 68). Hence the co-efficients 
coder interface module 645 is responsible for interfacing to the co-efficients 
coder and passes data and receives data from the coder via a local Bus. This 
module also reads data from buffers in zig zag order during compression and 
writes data to the buffers in reverse zig zag order during decompression. 
Both the DCT interface module 640 and the CC interface module 645 are able 
to read and write from buffers 64 1; 642. Hence, address and control 
multiplexer 647 is provided to select which buffer each of these interfaces is 
interacting with under the control of a control module 648, which comprizes a 
state machine for controlling all the various modules in the quantizer. The 
multiplier 643 can be a 16x8, 2's complement multiplier which multiplies 
DCT co-efficients by quantization table values. . 

Turning again to Fig. 68, the co-efficient coder 623 performs the 
functions of: 
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(a) predictive encoding/decoding of DC co-efficients in JPEG mode; 

and 

(b) run length encoding/decoding of AC co-efficients in JPEG mode. 
Preferably, the co-efficient coder 623 is also able to be utilized for 

predictive encoding/decoding of pixels and memory copy operations as required 
independently of JPEG mode operation. The co-efficient coder 623 
implements predictive and run length encoding and decoding of DC and AC co- 
efficients as specified in the Pink Book. A standard implementation of 
predictive encoding and predictive decoding in addition to JPEG AC co- 
efficients run lengthing encoding and decoding as specified in the JPEG 
standard is implemented. 

The Huffman coder 624 is responsible for Huffman encoding and 
decoding of the JPEG data train. In Huffinan encoding mode, the run length 
encoded data is received from the co-efficients coder 623 and utilized to 
produce a Huffman stream of packed bytes. Alternatively, or in addition, in 
Huffman decoding, the Huffman stream is read from the PBus interface 620 
in the form of packed bytes and the Huffman decoded co-efficients are 
presented to the co-efficient coder module 623. The Huffman coder 624 
utilizes Huffman tables stored in the data cache and accessed via OBus 
interface 634. Alternatively, the Huffman table can be hardwired for 
maximum speed. 

When utilising the data cache for Huffman coding, the eight banks of 
the data cache store data tables as follows with the various tables being 
described in further detail hereinafter. 
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Table 14: Huffman and Quantization Tables as stored in Data 



Cache 



Bank 


Description 


0 


This bank hold the 256, 16 bit entries of a 
EHUFCO_DC_l or EHUFCO table. The least 
significant bit of the index chooses between the two 16 
bit items in the 32 bit word. All 128 lines of this bank 
of memory are used. 


1 


This bank holds the 256, 16 bit entries of a 
EHUFCO_DC_2 table. The least significant bit of the 
index chooses between the two 16 bit items in the 32 bit 
word. All 128 lines of this bank of memory are used. 


2 


This bank holds the 256, 16 bit entries of a 
EHUFCCLAC_1 table. The least significant bit of the 
index chooses between the two 16 bit items in the 32 bit 
word. Al 128 lines of this bank of memory are used 


3 


This bank holds the 256, 16 bit entries of a 
EHUFCO^AC_2 table. The least significant bit of the \ 
index chooses between the two 16 bit items in the 32 bit 
word. All 128 lines of this bank of memory are used. 


4 


This bank holds the 256, 4 bit entires of a 
EHUFSI_DC_1 or EHUFSI table, as well as the 256, 4 
bit entires of a EHUFSI_DC_2 table. Ail 128 lines of 
this bank of memory are used. 


c 

o 


inis DanK nolas trie zoo, 4 bit entries ot a 
EHUFSI_AC_1 table, as well as the 256, 4 bit entries of 
a EHUFSI_AC_2 table. All 128 lines of this bank of 
memory are used. 


6 


Not used 


7 


This banks holds the 128, 24 bit entries of the 
quantization table. It occupies the least significant 3 
bytes of all 128 lines of this bank of memory, 



Turning now to Fig, 70, the Huffman coder 624 consists primarily of 
5 two independent blocks being an encoder 660 and a decoder 66 1. Both blocks 
660,661 share the same OB us interface via a multiplexer module 662. Each 
block has its own input and output with only one block active at a time, 
depending on the function performed by the JPEG encoder. 
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a. Encoding 

During encoding in JPEG mode, Huffman tables are used to assign 
codes of varying lengths (up to 16 bits per code) to the DC difference values 
and to the AC run-length values, which are passed to the HC submodule from 
5 the CC submodule. These tables have to be preloaded into the data cache 
before the start of the operation. The variable length code words are then 
concatenated with the additional bits for DC and AC co-efficients (also passed 
from the CC submodule, then packed into bytes. A X'OO byte is stuffed in if an 
XTTbyte is obtained as a result of packing. If there is a need for an RST™ 
10 marker it is inserted. This may require byte padding with w l" bits of the 
last Huffman code and X'OO byte stuffing if the padded byte results in X r FF. 
The need for an RST m marker is signalled by the CC submodule. The HC 

submodule inserts the EOI marker at the end of image, signalled by the 

"final" signal on the PBus-CC slave interface. The insertion procedure of 
15 the EOI marker requires similar packing, padding and stuffing operations as 

for RST m markers. The output stream is finally passed as packed bytes to 

the Result Organizer 249 for writing to external memory. 

In non-JPEG mode data is passed to the encoder from the CC 

submodule (PBus-CC slave interface) as unpacked bytes. Each byte is 
20 separately encoded using tables preloaded into the cache (similarly to JPEG 

mode), the variable length symbols are then assembled back into packed 

bytes and passed to the Results Organizer 249. The very last byte in the 

output stream ia padded with l's. 

b. Decoding 

25 Two decoding algorithms are implemented: fast (real time) and slow 

(versatile). The fast algorithm works only in JPEG mode, the versatile one 
works both in JPEG and non-JPEG modes. 

The fast JPEG Huffman decoding algorithm maps Huffman symbols 
to either DC difference values or AC run-length values. It is specifically 

30 tuned for JPEG and assumes that the example Huffman tables (K3, K4, K5 
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and K6) were used during compression. The same tables are hard wired in to 
the algorithm allowing decompression without references to the cache memory. 
This decoding style is intended to be used when decompressing images to be 
printed where certain data rates need to be guaranteed. The data rate for 
the HC sub module decompressing a band (a block between RST m markers) is 
almost one DC/AC co-efficient per clock cycle. One clock cycle delay between 
the HC submodule and CC sub-module may happen for each XOO stuff byte 
being removed from the data stream, however this is strongly data dependent. 

The Huffman decoder opemteR in n faster mode for the extraction of 
one Huffman symbol per clock cycle. The fast Huffman decoder is described 
in the section herein entitled Decoder of Variable Length Codes, 

Additionally, the Huffman decoder 661 also implements a heap-based 
slow decoding algorithm and has a structure 670 as illustrated in Fig. 71. 

For a JPEG encoded stream, the STRIPPER 671 removes the X'OO 
stuff bytes, the X'FF fill bytes and RST m markers, passing Huffman symbols 

with concatenated additional bits to the SHIFTER 672. This stage is 
bypassed for Huffman-only coded streams. 

The first step in decoding a Huffman symbol is to look up the 256 
entries HUFVAL table stored in the cache addressing it with the first 8 bits 
of the Huffman data stream. If this yields a value (and the true length of the 
corresponding Huffman symbol), the value is passed on to the OUTPUT 
FORMATTER 676, and the length of the symbol and the number of the 
additional bits for the decoded value are fed back to the SHIFTER 672 
enabling it to pass the relevant additional bits* to the OUTPUT FORMATTER 
670 and align the new front of the Huffman stream presented to the decoding 
unit 673. The number of the additional bits is a function of the decoded 
value. IF the first look up does not result in a decoded value, which means 
that the Huffman symbol is longer than 8 bits, the heap address is calculated 
and successive heap (located in the cache, too) accesses are performed 
following the algorithm until a match is found or an "illegal Huffman symbol" 
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condition met. A match results in identical behavior as in case of the first 
match and "illegal Huffman symbol" generates an interrupt condition. 

The algorithm for heap-based decoding algorithm is as follows: 

loop until end of image 

set symbol length N to 8 

get first 8 bits of the input stream into INDEX 
fetch HUFVALflNDEX) 

if HUFVAL(INDEX) — OOxx 0000 111 - (ILL) 
signal "illegal Huffman symbol" 
exit 

elsif HUFVALflNDEX) = lnnn eeee eeee - (HIT) 
pass nnn bits to eeee eeee as the value 
pass symbol length N = decimal (nnn)/* 000 
as symbol length 8*/ 
adjust the input stream 
break 

else/* HUFVAL (INDEX) = Oliii iiii iiii (MISS)*/ 

set HEAPINDEX ~ ii iiii iiii -- (we assume heapbase = 0) 
eel N = 9 

if 9th bit of the input stream = 0 

■<, 

increment HEAPINDEX 

fi 

fetch VALUE = HEAP (HEAPINDEX) -- (code for 9th bit) • 
loop 

if VALUE = 0001 0000 111 1 -- (ILL) 
signal "illegal Huffman symbol" 
exit 

elsif VALUE = 1000 eeee eeee 
pass eeee eeee as the value 
pass symbol length N 
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adjust the input stream 
break 

else/* VALUE = Oliii iiii iiii - (MISS) */ 

set N = N + 1 - (HEAPINDEX = ii iiii iiii) 
if Nth bit of the input stream = 0 
increment HEAPINDEX 

fi 

fetch VALUE = HEAP (HEAPINDEX) 

pool 

pool 

The STRIPPER 671 removes any X00 stuff bytes, XTF fill bytes and 
RST m markers from the incoming JPEG 671 coded stream and passes "clean" 

Huffman symbols with concatenated additional bits to the shifter 672. 
There arc no additional bits in Huffman-only encoding, so in this mode the 
passed stream consists of Huffman symbols only. 

The shifter 672 block has a 16 bit output register in which it presents 
the next Huffman symbol to the decoding unit 673 (bitstream running from 
MSB to LSB). Often the symbol is shorter than 16 bits, but it is up to the 
decoding unit 673 to decide how many bits are currently being analysed. The 
shifter 672 receives a feedback 678 from the decoding unit 673, namely the 
length of the current symbol and the length of the following additional bits for 
the current symbol (in JPEG mode), which allows for a shift and proper 
alignment of the beginning of the next symbol in the shifter 672. 

The decoding unit 673 implements the core of the heap based 
algorithm and interfaces to the data cache via the OBus 674. It incorporates 
a Data Cache fetch block, lookup value comparator, symbol length counter, 
heap index adder and a decoder of the number of the additional bite (the 
decoding is based on the decoded value). The fetch address is interpreted as 
follows: 
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Table 15: Fetch Address 



Field (bits) 


Description 


[32:25] 


Index into dequantization tables. 


[24:191 


Not used. 


[18:9] 


Index into the heap. 


[8:0] 


Index into Huffman decode table. 



The OUTPUT FORMATTER block 676 packs decoded 8-bit values 
(standalone Huffman mode), or packs 24-bit valu© +■ additional bits + RST m 

marker information (JPEG mode) into 32-bit words. The additional bits are 
passed to the OUTPUT FORMATTER 676 by the shifter 672 after the 
decoding unit 673 decides on the start position of the additional bits for the 
current symbol. The OUTPUT FORMATTER 673 also implements a 2 deep 
FIFO buffer using a one word delay for prediction of the final value word. 
During the decoding process, it may happen that the shifter 672 (either fast or 
slow) tries to decode the trailing padding bits at the end of the input 
bitstream. This situation is normally detected by the shifter and instead of 
asserting the "illegal symbol" interrupt, it asserts a "force final" signal. 
Active "force final" signal forces the OUTPUT FORMATTER 676 to signal the 
last but one decoded word as "final" (this word is still present in the FIFO) 
and discard the very last word which does not belong to the decoded stream. 

The Huffman encoder 660 of Fig. 70 is illustrated in Fig. 72 in more 
.detail. The Huffman encoder 660 maps byte data into Huffinan symbols via 
look up tables and includes a encoding unit 681, a shifter 682 and a OUTPUT 
FORMATTER 683 with the lookup tables being accessed from the cache. 

Each submitted value 685 is coded by the encoding unit 681 using 
coding tables stored in the data cache. One access to the cache 230 is needed 
to encode a symbol, although each value being encoded requires two tables, 
one that contains the corresponding code and the other that contains the code 
length. During JPEG compression, a separate set of tables is needed for AC 
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and DC co-efficients. If subsampling is performed, separate tables are 
required for subsampled and non subsampled components. For non-JPEG 
compression, only two tables (code and size) are needed. The code is then 
handled by the shifter 682 which assembles the outgoing stream on bit level. 
The Shifter 682 also performs RST m and EOI markers insertion which 
implies byte padding, if necessary. Bytes of data are then passed to the 
OUTPUT FORMATTER 683 which does stuffing (with X'OO bytes), filling with 
X'FF bytes, also the FF bytes leading the marker codes and formatting to 
packed bytes. In the non-JPEG mode, only formatting of packed bytes is 
required. 

Insertion of X'FF bytes is handled by the snifter 682, which means 
that the output formatter 683 needs to tell which bytes passed from the 
shifter 682 represent markers, in order to insert an X'FF byte before. This is 
done by having a register of tags which correspond to bytes in the shifter 682. 
Each marker, which must be on byte boundaries anyway, is tagged by the 
shifter 682 during marker insertion. The packer 683 does not insert stuff 
bytes after the X"FF" bytes preceding the markers. The tags are shifted 
synchronously with the main shift register. 

The Huffman encoder uses four or eight tables during JPEG 
compression, and two tables for straight Huffman encoding. The tables 
utilized are as follows: 
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Table 16: Tables Used by the Huffman Encoder 



Name 


Size 


Dp rpt infirm 


EHUFSI 


256 


Huffman code sizes. Used during 
straight Huffman encoding. Uses the 
coded value as an index. 


EHUFCO 


256 


Huffman code values used during 
straight Huffman encoding. Uses the 
coded value as an index. 




X D 


xiuinnan codes sizes used to code DU 
co-efficients during JPEG compression. 
Uses magnitude category as the index. 


EHUFCOJDC_ 

A 


16 


Huffman code values used to code DC 
co-efficients during JPEG compression. 
Uses magnitude category as an index. 
Used for subsampled blocks. 


EHUFSI_DC_2 


16 


Huffman code sizes used to code DC co- 
efficients during JPEG compression. B 
Uses magnitude category as an index. 
Used for subsampled blocks. 


EHUFCO_DC_ 
2 


16 


Huffman code sizes used to code DC 
co-efficients during JPEG compression. 
Uses magnitude category as an index. 
Used for subsampled blocks. 
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Name Size Description 


JiiJl U r ol_AU_l 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression. 
Uses magnitude category and run- 
length as an index. 


EHUFCO_AC_ 
1 


256 


Huflman code sizes used to code AC co- 
efficients during JPEG compression. 
Uses magnitude category and run- 
length as an index. 


EHUFSI_AC_2 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression for 

ou.uoaiu.picu. t>Ull_l^JiJilfc:H La . USGS 

magnitude category and run-length as 
an index. 


EHUFCO_AC_ 
2 


256 


Huflman code sizes used to code AC co- 
efficients during JPEG compression for 
subsampled components. Uses 
magnitude category and run-length as 
an index. 



3. 17.4 Table Indexing 

Huffman tables are stored locally by the co-processor data cache 230. 
The data cache 230 is organized as a 128 line, direct mapped cache, where 
each line comprizes 8 words. Each of the words in a cache line are separately 
addressable, and the Huffman decoder uses this feature to simultaneously 
access multiple tables. Because the tables are small (<= 256 entries), the 32 
bit address field of the OBus can cany indexes into multiple tables. 

As noted previously, in JPEG slow decoding mode, the data cache is 
utilized for storing various Huffman tables. The format of the data cache is 
as follows: 
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Table 17: Bank Address for Huffman and Quantization Tables 



Bank 


Description 


0 to 3 


These banks hold the 1024, 16 bit entries of the heap. 
The least significant index bit selects between the two 
16 bit words in each bank. All 128 lines of the four 
banks of memory are used. 






L Bank 


Description 


4 


This bank holds the 512, least significant 8 bits of the 
12 bit entries of the DC Huffman decode table. The 
least significant two bits of the index chooses between 
me lour, oyre lienib in tne oz oil word. All 128 line of 
this bank of memory are used. 


5 


This bank holds the 512, least significant 8 bits of the 
12 bit entires of the AC Huffman decode table. The 
least sigmncant uwo oils 01 tne index cnooses between 
the four, byte items in the 32 bit word. All 128 lines 
of this bank of memory are used. 


6 


This bank holds the most significant 4 bits of both the 
DC and AC Huffman decode tables. The least 
significant 2 bits of each index chooses between the 4 
respective nibbles within each word. 


7 


This bank holds the 128, 24 bit entires of the "1 
quantization table. It occupies the least significant 3 
bytes of all 128 lines of this bank of memory. 



Prior to each JPEG instruction being executed by the JPEG coder 241 
(Fig. 2) the appropriate image width value in the image dimensions register 
(POJDR) or (ROJDR) must be set. As with other instructions, the length of 
- the instruction refers to the number of input data items to be processed. 
This includes any padding data and accounts for any sub-sampling options 
utilized and for the number of color channels used. 

All instructions issued by the co-processor 224 may utilize two 
facilities for limiting the amount of output data produced. These facilities 
are most useful for instructions where the input and output data sizes are not 
the same and in particular where the output data size is unknown, such as for 
JPEG coding and decoding. The facilities determine whether the output data 
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is written out or merely discarded with everything else being as if the 
instruction was properly processed. By default, these facilities are normally 
disabled and can be enabled by enabling the appropriate bits in the RO_CFG 
register. JPEG instructions however, include specific option for setting these 
bits. Preferably, when utilising JPEG compression, the co-processor 224 
provides facilities for "cutting" and "limiting" of output data. 

Turning to Fig. 73, there is now described the process of cutting and 
limiting. An input image 690 may be of a certain height 691 and a certain 
width 692. Often, only a portion of the image is of interest with other 
portions being irrelevant for the purposes of printing out. However, the JPEG 
encoding system deals with 8x8 blocks of pixels. It may be the case that, 
firstly, the image width is not an exact multiple of 8 and additionally, the 
section of interest comprising MCU 695 does not fit across exact boundaries. 
An output cut register, ROjsut specifies the number of output bytes at 696 at 
the beginning of the output data stream to discard. Further, an output limit 
register, RO_LMT specifies the maximum number of output bytes to be 
produced. This count includes any bytes that do not get written to memory 
as a result of the cut register. Hence, it is possible to target a final output 
byte 698 beyond which no data is to be outputted. 

There are two particular cases where the cut and limited functionality 
of the JPEG decoder is considered to be extremely useful. The first case, as 
illustrated in Fig. 74, is the extraction or decompression of a sub-section 700 
of one strip 701 of a decompressed image. The second useful case is 
illustrated in Fig. 75 wherein the extraction or decompression of a number of 
complete strips (eg. 711, 712 and 713) is required from an overall image 714. 

The instruction format and field encoding for JPEG instructions is as 
illustrated in Fig. 76. The minor opcode fields are interpreted as follows: 
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Table 18: Instruction Word - Minor Opcode Fields 



Field 


1 Deecrintion 


D 


0 = enoodefcompress) 

1 — decode (As co m nrp^Q^ 


M 


0 - single color channel 

1 = multi channel 


4 


0 - three channel 

1 = four channel 


S 


0 = do not use a sub/up sampling regime ~ 

1 — use a subsamnlinfir remmp 


H 


0 = use fast Huffman coding 

1 = use general purpoae Huffman coding 


C 


0 = do not use cut register 

1 = use cut register 


T 


0 = do not truncate on output 

1 = truncate on output 


F 


0 = do not low pass filter before subsampling 

1 - low pass filter before subsampling 



3. 17. 5 Data Coding Instructions 

Preferably, the co-proceBsor 224 provides for the ability to utilize 
portions of the JPEG coder 241 of Fig. 2 in other ways. For example, 
Huffman coding is utilized for both JPEG and many other methods of 
compression. Preferably, there is provided data coding instructions for 
manipulating the Huffman coding unit only for hierarchial image 
decompression. Further, the run length coder and decoder and the predictive 
coder can also be separately utilized with similar instructions. 

3.17.6 A Fast DCT Apparatus 

Conventionally, a discrete cosine transform (DCT) apparatus as shown 
in Fig, 77 performs a full two-dimensional (2-D) transformation of a block of 
8x8 pixels by first performing a 1-D DCT on the rows of the 8x8 pixel block. 
It then performs another 1-D DCT on the columns of the 8x8 pixel block. 
Such an apparatus typically consists of an input circuit 1096, an arithmetic 
circuit 1104, a control circuit 1098, a transpose memory circuit 1090, and an 
output circuit 1092. 
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The input circuit 1096 accepts 8-bit pixels from the 8x8 block. The 
input circuit 1096 is coupled by intermediate multiplexers 1100, L102 to the 
arithmetic circuit 1004. The arithmetic circuit 1104 performs mathematical 
operations on either a complete row or column of the 8x8 block. The control 
circuit. 1098 controls all the other circuits, and thus implements the DCT 
algorichm. The output of the arithmetic circuit is coupled to the transpose 
memory 1090, register 1095 and output circuit 1092. The transpose memory 
is in turn connected to multiplexer 1100, which provides output to the next 
multiplexer 1102. The multiplexer 1102 also receives input from the register 
1094. The transpose circuit 1090 accepts 8x8 block data in rows and 
produces that data in columns. The output circuit 1092 provides the 
coefficients of the DCT performed on a 8x8 block of pixel data. 

In a typical DCT apparatus, it is the speed of the arithmetic circuit 1 104 
that basically determines the overall speed of the apparatus, since the 
arithmetic circuit 1104 is the most complex. 

The arithmetic circuit 1104 of Fig. 77 is typically implemented by 
breaking the arithmetic process down into several stages as described 
hereinafter with reference to Fig. 78. A single circuit is then built that 
implements each of these stages 1114, 1148, 1152 : 1156 using a pool of 
common resources, such as adders and multipliers. Such a circuit 1104 is 
mainly disadvantageous due to it being slower than optimal, because a single, 
common circuit is used to implement the various stages of circuit 1104. This 
includes a storage means used to store intermediate results. Since the time 
allocated for the clock cycle of such a circuit must be greater or equal to the 
time of the slowest stage of the circuit, the overall time is potentially longer 
than the sum of all the stages. 

Fig. 78 depicts a typical arithmetic data path, in accordance with the 
apparatus of Fig. 77, as part of a DCT with four stages. The drawing docs 
not reflect the actual implementation, but instead reflects the functionality. 
Each of the four stages 1144, 1148, 1152, and 1166 is implemented using a 
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single, reconfigurable circuit. It is reconfigured on a cycle -by-cycle basis to 
implement each of the four arithmetic stages 1144, 1148, 1152, and 1156 of 
the 1-D DCT. In this circuit, each of the four stages 1144, 1148, 1152, and 
1156 uses pool of common resources (e.g. adders and multipliers) and thus 
5 minimises hardware. 

However, the disadvantage of this circuit is that it is slower than 
optimal. The four stages 1144, 1148, 1152, and 1156 are each 
implemented from the same pool of adders and multipliers. The period of 
the clock is therefore determined by the speed of the slowest stage, which in 

10 this example is 20 ns (for block 1144). Addingin the delay (2ns each) of the 
input and output multiplexers 1146 and 1154 and the delay (3ns) of the flip- 
flop 1150, the total time is 27 ns. Thus, the fastest this DCT 
implementation can run at is 27 ns. 

Pipelined DCT implementations are also well known. The drawback 

15 with such implementations is that they require large amounts of hardware to 
implement. Whilst the present invention does not offer the same 
performance in terms of throughput, it offers an extremely good 
performance/size compromise, and good speed advantages over most of the 
current DCT implementations. 

20 Fig. 79 shows a block diagram of the preferred form of discrete cosine 

transform unit utilized in the JPEG coder 241 (Fig. 2) where pixel data is 
inputted to an input circuit 1126 which captures an entire row of 8-bit pixel 
data. The transpose memory 1118 converts row formatted data into column 
formatted data for the second pass of the two dimensional discrete cosine 

25 transform algorithm. Data from the input circuit 1126 and the transpose 
memory 11 18 is multiplexed by multiplexor 1124, with the output data from 
multiplexer 1124 presented to the arithmetic circuit 1122. Results data 
from the arithmetic circuit 1122 i6 presented to the output circuit 1120 after 
the second pass of the process, The control circuit 1116 controls the flow of 

30 data through the discrete cosine transform apparatus. 
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During the first pass of the discrete cosine transform process row data 
from the image to be transformed, or transformed image coefficients to be 
transformed back to pixel data is presented to the input circuit 1 126, During 
this first pass, the multiplexer 1124 is configured by the control circuit 1116 
to pass data from the input circuit 1126 to the arithmetic circuit 1122. 

Turning to Fig. 80, there is shown the structure of the arithmetic 
circuit 1122 in more detail. In the case of performing a forward discrete cosine 
transform, the results from the forward circuit 1138 which is utilized to 
calculate the forward discrete cosine transform is selected via the multiplexer 
1142, which is configured in this way by the control circuit 1116. When an 
inverse discrete cosine transform is to be performed, the output from the 
inverse circuit 1140 is selected via the multiplexer 1142, as controlled by the 
control circuit 1126. During the first pass, after each row vector has been 
processed by the arithmetic circuit 1122 (configured in the appropriate way by 
control circuit 1116), that vector is written into the transpose memory 1118. 
Once all eight row vectors in an 8x8 block have been processed and written 
into the transpose memory 1118, the second pass of the discrete cosine 
transform begins. 

During the second pass of either the forward or inverse discrete cosine 
transforms, column ordered vectors are read from the transpose memory 1118 
and presented to the arithmetic circuit 1122 via the multiplexer 1124. 
During this second pass, the multiplexer 1124 is configured by the control 
circuit to ignore data from the input circuit 1136 and pass column vector data 
from the transpose memory 1118 to the arithmetic circuit 1122, The 
multiplexer 1142 in the arithmetic circuit 1122 is configured by the control 
circuit 1116 to pass results data from the inverse circuit 1140 to the output of 
the arithmetic circuit 1122. When results from the arithmetic circuit 1122 
are available, they are captured by the output circuit 1120 under direction 
from the control circuit 1116 to be outputted sometime later. 
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The arithmetic circuit 1122 is completely combinatorial, in that is 
there are no storage elements in the circuit storing intermediate results. The 
control circuit 1116 knows how long it takes for data to flow from the input 
circuit 1136, through the multiplexer 1124 and through the arithmetic circuit 
1122, and so knows exactly when to capture the results vector from the 
outputs of the arithmetic circuit 1122 into the output circuit 1120. The 
advantage of having no intermediate stages in the arithmetic circuit 1122 is 
that no time is wasted getting data in and out of intermediate storage 
elements, but also the total time taken for data to flow through the arithmetic 
circuit 1122 is equal to the sum of all the internal stages and not N times the 
delay of the longest stage (as with conventional discrete cosine transform 
implementations), where N is the number of stages in the arithmetic circuit. 

Referring to Fig. 81, the total time delay is simply the sum of the four 
stage 1158, 1160, 1162, 1164, which is 20 ns + 10 ns + 12 ns + 15 ns = 57 ns, 
which is faster that the circuit depicted in Fig. 78. The advantage of this 
circuit is that it provides an opportunity to reduce the overall system's clock 
period. Assuming that four clock cycles are allocated to getting a result from 
the circuit depicted in Fig. 81, the fastest run time for the entire DCT system 
would be 57/4 ns (14.25 ns), which is a significant improvement over the 
circuit in Fig. 78 which only allows for a DCT clock period of substantially 27 
ns. 

An examplary implementation of the present DCT apparatus might, 
but not necessarily, use the DCT algorithm proposed in the paper to The 
Transactions of the IEICE, Vol. E 71. No. 11, November 1988, entitled A Fast 
DCT-SQ Scheme for Images at page 1095 by Yukihiro Arai, Takeshi Agui and 
Masayuki Nakajima. By implementing this algorithm in hardware, it can 
then easily be placed in the current DCT apparatus in the arithmetic circuit 
1122. Likewize, other DCT algorithms may be implemented in hardware in 
place of arithmetic circuit 1 122. 
3.17.7 Huffman Decoder 
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The aspects of the following embodiment relate to a method and 
apparatus for variable-length codes interleaved with variable length bit fields. 
En particular, the embodiments of the invention provide efficient and fast, 
single stage (clock cycle) decoding of variable -length coded data in which byte 
aligned and not variable length encoded data is removed from the encoded 
data stream in a separate pre-processing block. Further, information about 
positions of the removed byte-aligned data is passed to the output of the 
decoder in a way which is synchronous with the data being decoded. In 
addition, it provides fast detection and removal of not byte-aligned and not 
variable length encoded bit fields that are still present in the pre-processed 
input data. 

The preferred embodiment of the present invention preferably provides 
for a fast Huffman decoder capable of decoding a JPEG encoded data at a rate 
of one Huffman symbol per clock cycle between marker codes. This is 
accomplished by means of separation and removal of byte aligned and not 
Huffman encoded marker headers, marker codes and stuff bytes from the 
input data first in a separate pre-processing block. After the byte aligned 
data is removed, the input data is passed to a combinatorial data-shifting 
block, which provides continuous and contiguous filling up of the data decode 
register that consequently presents data to a decoding unit. Positions of 
markers removed from the original input data stream are passed on to a 
marker shifting block, which provides shifting of marker position bits 
synchronously with the input data being shifted in the data shifting block. 

The decoding unit provides combinatorial decoding of the encoded bit 
field presented to its input by the data decode register. The bit field is of a 
fixed length of n bits. The output of the decoding unit provides the decoded 
value (v) and the actual length (m) of the input code, where m is less than or 
equal to n. It also provides the length (a) of a variable length bit field, where 
(a) is greater than or equal to 0. The variable-length bit field is not Huffman 
encoded and follows immediately the Huffman code. The n-long bit field 
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presented to the input of the decoding unit may be longer than or equal to the 
actual code. The decoding unit determines the actual length of the code (m) 
and passes it together with the length of the additional bits (a) to a control 
block. The control block calculates a shift value (a+m) driving the data and 
marker shifting blocks to shift the input data for the next decoding qycle. 

The apparatus of the invention can comprise any combinatorial decoding 
unit, including ROM, RAM, PLA or anything else based as long as it provides 
a decoded value, the actual length of the input code, and the length of the 
following not Huffman encoded bit fieldwithin a given time frame. 

In the illustrated embodiment, the decoding unit outputs predictively 
encoded DC difference values and AC run-length values as defined in JPEG 
standard. The not Huffman encoded bit fields, which are extracted from the 
input data simultaneously with decoded values, represent additional bits 
determining the value of the DC and AC coefficients as defined in JPEG 
standard. Another kind of not Huffman encoded bit fields, which are 
removed from the data present in the data decode register, are padding bits 
as defined in JPEG standard that precede byte-aligned markers in the 
original input data stream. These bits am detected by the control block by 
checking the contents of a padding zone of the data register. The padding 
zone comprises up to k most significant bits of the data register and is 
indicated by the presence of a marker bit within k most significant bits of the 
marker register, position of said marker bit limiting the length of the padding 
zone. If all the bits in the padding zone are identical (and equal to Is in case 
of JPEG standard), they are considered as padding bits and are removed from 
the data register accordingly without being decoded. The contents of the data 
and marker registers are then adjusted for the next decoding cycle. 

The exemplary apparatus comprises an output block that handles 
formatting of the outputted data according to the requirements of the 
preferred embodiment of the invention. It outputs the decoded values 
together with the corresponding not variable length encoded bit fields, such as 
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additional bite in JPEG, and a signal indicating position of any inputted byte 
aligned and not encoded bit fields, such as markers in JPEG, with respect to 
the decoded values. 

Data being decoded by the JPEG coder 241 (Fig. 2) is JPEG 
compatible and comprizes variable length Huffinan encoded codes interleaved 
with variable length not encoded bit fields called "additional bits", variable 
length not encoded bit fields called "padding bits" and fixed length, byte 
aligned and not encoded bit fields called "markers", "stuff bytes" and "fill 
bytes". Fig. 82 shows a representative example of input data. 

The overall structure and the data flow in the Huffman decoder of the 
JPEG coder 241 is presented in Fig. 83 and Fig. 84, where Fig 83 illustrates 
the architecture of the Huffinan decoder of the JPEG data in more detail. 
The stripper 1171 removes marker codes (code FFXX hex> XX being non aero), 
fill bytes (code FF hex ) and stuff bytes (code 00 hex following code FF hex) that 
is all byte aligned components of the input data, which are presented to the 
stripper as 32 bit words. The most significant bit of the first word to be 
processed is the head of the input bit stream. In the stripper 1171, the byte 
aligned bit fields are removed from each input data word before the actual 
decoding of Huffman codes takes place in the downstream parts of the 
decoder. 

The input data arrives at the stripper's 1171 input as 32-bit words, 
one word per clock cycle. Numbering of the input bytes 1211 from 0 to 3 is 
shown in Fig. 85. If a byte of a number (i) is removed because it is a fill byte, 
a stuff byte or belongs to a marker, the remaining bytes of numbers (i-1) down 
to 0 are shifted to the left on the output of the stripper 1171 and take 
numbers (i) down to 1. Byte 0 becoming a "don't care" byte. Validity of 
bytes outputted by the stripper 1171 is also coded by means of separate 
output tags 1212 as shown in Fig. 85. The bytes which are not removed by 
the stripper 1 171 are left aligned on the stripper's output. Each byte on the 
output has a corresponding tag indicating if the corresponding byte is valid (i.e. 
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passed on by the stripper 1171), or invalid {i.e. removed by the stripper 1171) 
or valid and following a removed marker. The tags 1212 control loading of 
the data bytes into the data register 1182 through the data shifter and 
loading of marker positions into the marker register 1183 through the marker 
shifter. The same scheme applies if more than one byte is removed from the 
input word: all the remaining valid bytes are shifted to the left and the 
corresponding output tags indicate validity of the output bytes. Pig. 86 
provides examples 1213 of output bytes and output tags for various example 
combinations of input bytes. 

Returning to Fig. 83, the role of the preshifter and postshifter blocks 
1172, 1173, 1180, 1181 is to assure loading of the data into the corresponding 
data register 1182 and marker register 1183 in a contiguous way whenever 
there is enough room in the data register and the marker register. The data 
shifter and the marker shifter blocks, which consist of the respective pre- and 
postshifters, are identical and identically controlled. The difference is that 
while the data shifter handles data passed by the stripper 1171, the marker 
shifter handles the tags only and its role is to pass marker positions to the 
output of the decoder in a way synchronous with the decoded Huffman values. 
The outputs of the postshifters 1180, 1181 feed directly to the respective 
registers 1182, 1183, as shown in Fig. 83. 

In the data preshifter 1172, as also Bhown in Fig. 86, data arriving 
from the stripper 1171 is firstly extended to 64 bits by appending 32 zeroes to 
the least significant bit 1251. Then the extended data is shifted in a 64 bit 
wide barrel shifter 1252 to the right by a number of bits currently present in 
the data register 1182. This number is provided by the control logic 1185 
which keeps track of how many valid bite are there in the data 1182 and 
marker 1 183 registers. The barrel shifter 1252 then presents 64 bits to the 
multiplexer block 1253, which consists of 64 2x1 elementary multiplexers 
1254. Each elementary 2x1 multiplexer 1254 takes as inputs one bit from 
the barrel shifter 1252 and one bit from the data register 1182. It passes the 
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data register bit to the output when this bit is still valid in the data register. 
Otherwize, it passes the barrel shifter's 1252 bit to the output. The control 
signals to all the elementary multiplexers 1254 are decoded from a control 
block's shift control 1 signals as shown in Fig. 86, which are also shown in Fig. 
87 as preshifter control bits (L.5 of register 1223. The outputs of the 
elementary multiplexers 1254 drive a barrel shifter 1255. It shifts left by 
the number of bits provided on a 5 bit control signal shift control 2 as shown 
in Fig 86. These bits represent the number of bits consumed from the data 
register 1182 by the decoding of the current data, which can be either the 
length of the currently decoded Huffman code plus the number of the following 
additional bits, or the number of padding bits to be removed if padding bits 
are currently being detected, or zero if the number of valid data bits in the 
data register 1182 is less then the number of bits to be removed. In this way, 
the data appearing on the output of barrel shifter 1255 contains new data to 
be loaded into the data register 1182 after a single decoding cycle. The 
contents of the data register 1182 changes in such a way that the leading 
(most significant) bits are shifted out of the register as being decoded, and 0, 8, 
16, 24 or 32 bits from the stripper 1171 are added to the contents of the data 
register 1 182 . If there are not enough bits in the data register 1182 to decode 
them, data from the stripper 1171, if available, is still loaded in the current 
cycle. If there is no data available from the stripper 1 171 in the current cycle, 
the decoded bits from the data register 1182 are still removed if there is a 
sufficient amount of them, otherwize the content of the data register 1182 
does not change. 

The marker preshifter 1173, postshifter 1181 and the marker register 
1183 are units identical to the data preshifter 1172, data postshifter 1180 
and the data register 1182, respectively. The data flow inside units 1173, 
1181 and 1183 and among them is also identical as the data flow among 
units 1172, 1180 and 1182. The same control signals are provided to both 
sets of units by the control unit 1185. The difference is only in the type of 
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data on the inputs of the marker preshifter 1 173 and data preshifter 1172, as 
well as in how the contents of the marker register 1183 and the data register 
1182 are used. As shown in Fig. 88 : tags 1261 from the stripper 1171 come 
as eight bit words, which provide two bits for each corresponding byte of data 
going to the data register 1 182. According to the coding scheme shown in Fig. 
85, an individual two bit tag indicating valid and following a marker byte has 
1 on the most significant position. Only this most significant position of each 
of the four tags delivered by the stripper 1171 simultaneously is driven to the 
input 1262 of the marker preshifter 1173. In this way, on the input to the 
marker preshifter there maybe bits set to 1 indicating positions of the first 
encoded data bits following markers. At the same time, they mark the 
positions of the first encoded data bits in the data register 1182 which follow 
a marker. This synchronous behavior of the marker position bits in the 
marker register 1183 and the data bits in the data register 1182 is used in 
the control block 1185 for detection and removal of padding bits, as well as for 
passing marker positions to the output of the decoder in a way synchronous 
with the decoded data, As mentioned, the two preshifters (data 1172 and 
marker 1173), postshifters (data 1180 and marker 1181) and registers (data 
1182 and marker 1183) get the same control signals which facilitates fully 
parallel and synchronous operation. 

The decoding unit 1184, also shown in Fig. 89 gets the sixteen most 
significant bits of the data register 1182 which are driven to a combinatorial 
decoding unit 1184 for extraction of a decoded Huffman value, the length of 
the present input code being decoded and the length of the additional bits 
following immediately the input code (which is a function of the decoded value). 
The length of the additional bits is known after the corresponding preceding 
Huffman symbol is decoded, so is the starting position of the next Huffman 
symbol. This effectively requires, if speed of one value decoded per clock cycle 
is to be maintained, that decoding of a Huffman value is done in a 
combinatorial block. Preferably, the decoding unit comprizes four PLA style 
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decoding tables hardwired as a combinatorial block taking a 16-bit token on 
input from the data register 1182 and producing a Huffoian value (8 bits), the 
length of the corresponding Huffman-encoded symbol (4 bite) and the length of 
the additional bits (4 bits) as illustrated in Fig. 89. 

Removal of padding bits takes place dining the actual decoding when 
a sequence of padding bits is detected in the data register 1182 by a decoder 
of padding bits which is part of the control unit 1185. The decoder of padding 
bits operates as shown in Fig. 90. Eight most significant bits of the marker 
register 1183, 1242 are monitored for presence of a marker position bit. If a 
marker position bit is detected, all the bits in the data register 1182, 1241 
which correspond to, that is have the same positions as, the bits preceding the 
marker bit in the marker register 1242 are recognized as belonging to a 
current padding zone. The content of the current padding zone is checked by 
the detector of paddingbits 1243 for Vs. If all the bits in the current padding 
zone are l's, they are recognized as padding bits and are removed from the 
data register. Removal is done by means of shifting of the contents of the data 
register 1182, 1241 (and at the same time the marker register 1183, 1242) to 
the left using the respective shifters 1172, 1173, 1180, 1181 in one clock cycle, 
ae in normal decode mode with the difference that no decoded value is 
outputted. E not all the bits in the current padding zone are l's, a normal 
decode cycle is performed rather than a padding bits removal cycle. 
Detection of padding bits takes place each cycle as described, in case there are 
some padding bits in the data register 1182 to be removed. 

The control unit 1185 is shown in detail in Fig. 87. The central part 
of the control unit is the register 1223 holding the current number of valid bits 
in the data register 1182. The number of valid bits in the marker register 
1183 is always equal to the number of valid bits in the data register 1182. 
The control unit preforms three functions. Firstly, it calculates a new number 
of bits in the data register 1182 to be stored in the register 1223. Secondly, 
it determines control signals for the shifters 1172, 1173, 1180, 1181, 1186, 
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1187 decoding unit 1184, and the output formatter 1188. Finally, it detects 
padding bits in the data register 1182, as described above. 

The new number of bits in the data register 1182 (new_ncb) is 
calculated as the current number of bits in the data register 1182 (nob) plus 
the number of bits (nos) available for loading from the stripper 1171 in the 
current cycle, less the number of bits (nor) removed from the data register 
1182 in the current cycle, which is either a deoode cycle or a padding bits 
removal cycle. The new number of bits is calculated as follows: 

new_nob = nob + nos - nor 

The respective arithmetic operations are done in adder 1221 and 
subtractor 1222. It should be noted that (nos) can be 0 if there is no data 
available from the stripper 1171 in the current cycle. Also, (nor) can be 0 if 
there ie no decoding done in the current cycle because of shortage of bite in the 
data register 1182, which means there are less bits in the data register than 
the sum of the current code length and the following additional bits length as 
delivered by the control unit 1185. The value (new_nob) may exceed 64 and 
block 1224 checks for this condition. In such a case, the stripper 1171 is 
stalled and no new data is loaded. Multiplexer 1233 is used for zeroing the 
number of bits to bo loaded from the stripper 1171. A corresponding signal 
for stalling the stripper 1171 is not shown. Signal "padding cycle" driven by 
decoder 1231 controls multiplexer 1234 to select either the number of padding 
bits or the number of decoded bits (that is the length of code bits plus 
additional bits) as number of bits to be removed (nor). If the number of the 
decoded bits is greater than the number (nob) of the bits in the data register, 
which is checked in comparator 1228, the effective number of bits to shift as 
provided for multiplexer 1234 is set to zero by a complex NAND gate 1230. 
As a result, (nor) is set to zero and no bits are removed from the data register. 
The output of multiplexor 1234 is also used to control postshifteis 1182 and 
1183. The width of the data register 1182 must be chosen in a way 
preventing a deadlock situation. This means that at any time either there 
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needs to be room in the data register to accommodate the maximum number 
of bits available from the stripper 1171 or sufficient number of valid bits to be 
removed as a result of a decode or a padding of bits removed cycle. 

Calculation of the number of bits to be removed in a decode cycle is 
performed by adder 1226. Its operands come from the combinatorial 
decoding unit 1184. As the code length of 16 bits is coded as "0000" by the 
decoding unit, "orreduce" logic 1225 provides encoding of "0000" into 
w 10000", 3delding a correct unsigned operand. This operand together with the 
output of subtractor 1227 provide control signals to the output formatting 
shifters 1186 and 1187. 

Block 1229 is used for detection of EOI (End Of Image) marker 
position. The EOI marker itself is removed by the stripper 1171, but there 
can be some padding bits which are the very last bite of the data and which 
used to precede the EO[ marker before its removal in the stripper 1171. The 
comparator 1229 checks if the number of bits in the data register 1182, stored 
in register 1223 is less than eight. If it is, and there is no more data to come 
from the stripper 1171 (that is the data register 1182 holds all the remaining 
bits for of the data unit being decoded), the remaining bits define the size of 
the padding zone before the removed EOI marker. Further handling of the 
padding zone and possible removal of padding bits is identical to the 
procedure applied incase of padding bits before RST markers, which has been 
described before. 

Barrel shifters 1186, 1187 and output formatter 1188 play a support 
role and depending on the embodiment may have a different implementation 
or may not be implemented at all. Control signals to them come from the 
control unit 1185, as described above. The abjreshifter (additional bits 
preshifter) 1186 takes 32 bits from the data register as input and shifts them 
to the left by the length of the Huffman code being presently decoded. In this 
way, all the additional bits following the code being presently decoded appear 
left aligned on the output of the barrel shifter 1186 which is also the input to 
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the barrel shifter 1187. The abjostshifter (additional bits postshifter) 
1187 adjusts the position of the additional bits from left aligned to right 
aligned in an 11 bit field, as used in the output format of the data and shown 
in Fig. 91. The additional bits field extends from bit 8 to bit 18 in the output 
word format 1196 and some of the most significant bits may be invalid, 
depending on the actual number of the additional bits. This number in 
encoded on bits 0 to 3 of 1196, as specified by the JPEG standard. If a 
different format of the output data is adopted, the barrel shifters 1186 and 
1187 and their functionality may change accordingly. 

The output formatter block 1188 packs the decoded values, which in 
JPEG standard are DC and AC coefficients, (1196, bits 0 to 7) and a DC 
coefficient indicator (1196, bit 19) passed by the control unit 1185 together 
with the additional bits (1196, bits 8 to 18) passed by the abjostshifter 
1187 and the marker position bit (1196, bit 23) from the marker register 1183 
into words according to the format presented in Fig. 91. The output 
formatter 1188 also handles any particular requirements as to the output 
interface of the decoder. The implementation of the output formatter is 
normally expected to change if the output interface changes as a result of 
different requirements. 

The foregoing described Huffman decoder provides a highly effective form of 
decoding providing a high speed decoding operation. 
3.17.8 Image Transformation Instructions 

These instructions implement general affine transformations of 
source images. The operation to construct a portion of a transformed image 
falls generally into two broad areas. These include firstly working out which 
parts of the source image are relevant to constructing the current output 
scanline and, if necessary, decompressing them. The second step normally 
comprizes necessary sub -sampling and/or interpolation to construct the 
output image on a pixel by pixel basis. 
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Turning to Fig. 92, there is illustrated a flow chart of the steps 
required 720 to calculate the value of a destination pixel assuming that the 
appropriate sections of the source image have been decompressed. Firstly, 
the relevant sub-sampling, if present, must be taken into account 721. Next, 
two processes are normally implemented, one involving interpolation 722 and 
the other being sub-sampling. Normally interpolation and sub-sampling are 
alternative steps, however in some circumstances interpolation and sub- 
sampling may be used together, In the interpolation process, the first step is 
to find the four surrounding pixels 722, then determine if pre-multiplication is 
required 723, before performing bilinear interpolation 724. The bilinear 
interpolation step 724 is often computationally intensive and limits the 
operation of the image transformation process. The final step in calculating 
a destination pixel value is to add together the possibly bilinear interpolated 
sub-samples from the source image. The added together pixel values can be 
accumulated 727 in different possible ways to produce destination image 
pixels of 728. 

The instruction word encoding for image transformation instructions 
is as illustrated in Fig. 93 with the following interpretation being placed on 
the minor opcode fields. 
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Table 19: Instruction Word - Minor Opcode Fields 



Field 


Description 


S 


0 = bi-linear interpolation is used on the four 
surrounding source image pixels to determine the 
actually sampled valuo 

1 = sampled value is snapped to the closest 
source image pixel value 


off[3:0] 


0 = do not apply the offset register (mdp_por) to 
the corresponding channel 

1 = apply the offset register (mdp_por) to the 
corresponding channel 


P 


0 = do not pre-multiply source image pixels 

1 = pre-multiply source image pixels 


C 


0 = do not clamp output values 

1 = clamp output underflows to 0x00 and 
overflows to OxFF 


A 


0 = do not take absolute value of output values 

1 - take absolute value of output values before 
wrapping or clamping 



The instruction operand and result fields are interpreted as follows: 



Table 20: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format 


Operand A 


kernel descriptor 




short or long kernel 
descriptor table 


Operand B 


Source Image 
Pixels 


other 


image table format 


Operand C 


unused 






Result 


pixels 


pixies 


packed stream, 
unpacked bytes 



Operand A points to a data structure known as a "kernel descriptor" 
that describes all the information required to define the actual 
transformation. This data structure has one of two formats (as defined by 
the L bit in the A descriptor). Fig. 94 illustrates the long form of kernel 
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descriptor coding and Fig. 95 illustrates the short form of encoding. The 
kernel descriptor describes: 

1. Source image start coordinates 730 (unsigned fixed point, 24.24 
resolution). Location (0,0) is at the top left of the image. 

2. Horizontal 731 and vertical 732 (sub-sample) deltas (2's complement 
fixed point, 24.24. resolution) 

3. A 3 bit bp field 733 defining the location of the binary point within the 
fixed point matrix co-efficients as described hereinafter. 

4. Accumulation matrix co-efficients 735 (if present). These are of 
"variable" point resolution of 20 binary places (2's complement), with the 
location of the binary point implicitly specified by the bp field. 

5. An rl field 736 that indicates the remaining number of words in the 
kernel descriptor. This value is equal to the number of rows times the 
number of columns minus 1. 

The kernel co-efficients in the descriptor are listed row by row, with 
elements of alternate rows listed in reverse direction, thereby forming a zig 
zag pattern. 

Turning now to Fig. 96, the operand B consists of a pointer to an index 
table indexing into scan lines of a source image. The structure of the index 
table is as illustrated in Fig. 96, with the operand B 740 pointing to an index 
tabk 741 which in turn points to scan lines (eg. 742) of the required source 
image pixels. Typically, the index table and the source image pixels are 
racheahle and possibly located in the local memory. 

The operand C stores the horizontal and vertical sub-sample rate. 
The horizontal and vertical sub-sample rates are defined by the dimensions of 
the sub-sample weight matrix which are specified if the C descriptor is 
present. The dimensions of the matrix r and c are encoded in the data word 
of the image transformation instruction as illustrated in Fig. 97. 

Channel N of a resultant pixel PIN] is calculated in accordance with 
the following equation: 
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p[n]=(l t offset[n] mdp por :0000)^^ X w r,c sf* +rAx,y + cAy)[n] 

r c 

Internally, the accumulated value is kept to 36 binary places per channeL The 
location of the binary point within this ficldi 9 specified by the BP field. The BP field indicates 
the number of leading bits in the accumulated result to discard. The 36 bit accumulated value 
is treated as a signed 2'b compliment number and is clamped or wrapped as specified. In Fig. 
98, there is illustrated an example of the interpretation of the BP field in co-efficient encoding. 

3.17.9 Convolution Instructions 

Convolutions, as applied to rendering images, involves applying a two 
dimensional convolution kernel to a source image to produce a resultant 
image. Convolving is normally used for such matters as edge sharpening or 
indeed any image filter. Convolutions are implemented by the co-processor 
224 in a similar manner to image transformations with the difference being 
that, in the case of transformations the kernel is translated by the width of 
the kernel for each output pixel, in the case of convolutions, the kernel is 
moved by one source pixel for each output pixel. 

If a source image has values S(x,y) and a n x m convolution kernel has 
values C(x,y), then the nth channel of the convolution H[n] of S and C is given 
by: 

H(x,y)[n] = {Loffset{n].mdp^:<im) + Y, X S(x Hy + /) -C(l,])[n] 

' J 

where i e [0,c] and j € [0,r], 

The interpretation of the offset value, the resolution of intermediate results and the 
interpretation of the bp field are the same as for Image Transformation instructions. 

In Fig. 99, there is illustrated an example of how a convolution kernel 750 is applied 
to a source image 751 to produce a resultant image 752. Source image address generation and 
output pixel calculations are performed in a similar manner to that for image transformation 
instructions. The instruction operands take a similar form to image transformations. In Fig. 
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100, there is illustrated the instruction word encoding for convolution instructions with the 
following interpretation being applied to the various fields. 

Table 21: Instruction Word 



Fiel 

d 


Description 


S 


0 = bi-linear interpolation is used on the four surrounding 
source image pixels to determine the actually sampled 
value 

1 = sampled value is snapped to the closest source image 
pixel value 


c 


0 = do not clamp resultant vector values 

1 = clamp result vector values: underflow to 0x00, 
overflow to OxFF 


p 


0 - do not pre-multiply input pixels 

1 = pre multiply input pixels 


A 


0 = do not take absolute value of output values 

1 = take absolute value of output values before wrapping 
or clamping 


off[3:0] 


0 = do not apply the offset register to this channel 

1 = apply the offset register to this channel 



3.17.10 Matrix Multiplication 

Matrix multiplication is utilized for many things including being 
utilized for color space conversion where anaffine relationship exists between 
two color spaces. Matrix multiplication is defined by the following equation: 



L o_ 



b 0,0 b o,i b 0,2 b 0,3 b 04 
b l,0 b l,l kl.2 b l,3 b l,4 

b<> f\ b>> i b? fy b< 



2,0 b 2,l b 2,2 b 2,3 b 2,4 



L u 3>o 



b 3,l b 3,2 b 3,3 



J 3,4. 



a, 



1 



The matrix multiplication instruction operands and results have the 
following format: 
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Table 22: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format j 


Operand A 


6ource image pixels 


pixels 


packed stream 


Operand B 


matrix co-efficients 


other 


image table 
format 


Operand C 


unused 






Result 


pixels 


pixels 


packed stream, 
unpacked bytes 



The instruction word encoding for matrix multiplication instructions 
as illustrated in Fig. 101 with the following table summarising the minor 
opcode fields. 



Table 23: Instruction Word 



Field 


Description 




0 = do not clamp resultant vector values. 

1 = clamp resultant vector values: underflow to 0x00, 
overflow to OxFF 


P 


0 = do not pre-multiply input pixels 

1 = pre-multiply input pixels 


A 


0 = do not take absolute value of output values 

1 = take absolute value of output values before 
wrapping or clamping 



3.17.11 Halftoning 

The co-processor 224 implements a multi-level dither for halftoning. 
Anything from 2 to 255 is a meaningful number of halftone levels. Data to be 
halftoned can be either bytes (ie. unmeshed or one channel from meshed data) 
or pixels (ie. meshed) as long as the screen is correspondingly meshed or 
unmeshed. Up to four output channels (or four bytes from the same channel) 
can be produced per clock, either packed bits (for bi-level halftoning) or codes 
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(for more than two output levels) which are either packed together in bytes or 
impacted in one code per bye. 

The output half-toned value is calculated using the following formula: 
(p x a - 1) + d)/255 

Where p is the pixel value (0<p<265), 1 is the number of levels 
(2<1<255) and d is the dither matrix value (0<d<254). The operand encoding 
is as follows: 



Table 24: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format 


Operand A 


source image 
pixels 


pixels 


packed stream 




source image 
bytes 


packed bytes, 
unpacked bytes 


packed stream 


Operand B 


dither matrix 
co-efficients 


pixels, packed 
bytes, unpacked 
bytes 


packed stream, 
unpacked bytes 


Operand C 


unused 






Result 


halftone codes 


pixels, packed 
bytes unpacked 
byte 9 


packed stream, 
unpacked bytes 



In the instruction word encoding, the minor op code specifies a number 
of halftone levels. The operand B encoding is for the halftone screen and is 
encoded in the same way as a compositing tile. 
3.17-12 Hierarchial Image Format Decompression 

Hierarchial image format decompression involves several stages. 
These stages include horizontal interpolation, vertical interpolation, Huffman 
decoding and residual merging. Each phase is a separate instruction, In the 
Huffman decoding step, the residual values to be added to the interpolated 
values from the interpolation steps are Huffman coded. Hence, the JPEG 
decoder is utilized for Huffman decoding. 
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In Fig. 102, there is illustrated the process of horizontal interpolatioa 
The output stream 761 consists of twice as much data as the input stream 
762 with the last data value 763 being replicated 764. Fig. 103 illustrates 
horizontal interpolation by a factor of 4. 

In the second phase of hierarchial image format decompression, rows 
of pixels are up sampled by a factor of two or four vertically by linear 
interpolation. During this phase, one row of pixels is on operand A and the 
other row is on operand B. 

When vertically interpolating, either by a factor of two or four, the 
output data stream contains the same number of pixels as each input stream. 
In Fig. 104, there is illustrated an example of vertical interpolation wherein 
two input data streams 770, 771 are utilized to produce a first output stream 
772 having a factor of two interpolation or a second output stream 773 having 
a factor of 4 interpolation. In the case of pixel interpolation, interpolation 
occurs separately on each of the four channels of four channel pixels. 

The residual merging process involves the bytewize addition of two 
streams of data. The first stream (operand A) is a stream of base values and 
the second stream (operand B) is a stream of residual values. 

In Fig. 105, there is illustrated two input streams 780, 781 and a 
corresponding output stream 782 for utilising the process of residual merging. 

In Fig. 106 there is illustrated the instruction word encoding for 
hierarchial image format instructions with the following table providing the 
relevant details of the minor op code fields. 
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Table 25: Instruction Word - Minor Opcode Fields 



1 Field 


Description 


R 


0 = interpolation 

1 = residual merging 


V 


0 = horizontal interpolation 

1 = vertical interpolation 


F 


0 = interpolate by a factor of 2 

1 = interpolate by a factor of 4 


C 


0 = do not clamp resultant values 

1 = clamp resultant values: underflow to 0x00, 
overflow to OxFF 



3.17.13 Memory Copy Instructions 

Those instructions are divided into two specifically disjointed groups, 
a. General purpose data movement instructions 

These instructions utilize the normal data flow path through the co- 
processor 224, comprising the input interface module, input interface switch 
252, pixel organizer 246, JPEG coder 241, result organizer 249 and then thu 
output interface module. In this case, the JPEG coder module sends data 
straight through without applying any operation. 

Other instructions include data manipulation operations including: 

• packing and unpacking sub-byte values (such as bits, two bit 
values and four hit values) to a byte 

• packing and unpacking bytes within a word 

• aligning 

• meshing and unmeshing 

• byte lane swapping and duplicating 

• memory clearing 

• replicating values 

The data manipulation operation is carried out by a combination of 
the pixel organizer (on input) and the result organizer (on output). In many 
cases, these instructions can be combined with other instructions. 
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b. Local DMA instructions 

No data manipulation takes place, As seen in Fig. 2 data transfer 
occurs (in either direction) between the Local Memory 236 and the Peripheral 
Interface 237. These instructions are the only ones for which execution can be 
overlapped with some other instruction. A maximum of one of these 
instructions can execute simultaneously with a "non overlapped" instruction. 

In memory copy instructions, operand A represents the data to be 
copied and the result operand represents the target address of the memory 
copy instructions. For general purpose memory copy instructions, the 
particular data manipulation operation is specified by the operand B for input 
and operand C for output operand words. 
3.17.14 Flow Control Instructions 

The flow control instructions are a family of instructions that provide 
control over various aspect of the instruction execution model as described 
with reference to Fig. 9. The flow control instructions include both 
conditional and unconditional jumps enabling the movement from one virtual 
address to another when executing a stream of instructions. A conditional 
jump instruction is determined by taking a co-proceeeor or register, masking 
off any relevant fields and comparing it to given value. This provides for 
reasonable generality of instructions. Further, flow control instructions 
include wait instructions which are typically used to synchronize between 
overlapped and non-overlapped instructions or as part of micro-programming. 

In Fig. 107, there is illustrated instruction when encoding for flow 
control instructions with the minor opcodes being interpreted as follows: 
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Table 26: Instruction Word - Minor Opcode Fields 



1 Field Description 


type 


00 = jump 
01= wait 


C 


0 = unconditional jump 

1 = condition jump 


c 


0 = use Operand B as Condition Register 
and Operand C as Condition mask 

1 = any interrupt condition set [ 


N 


0 = jump if condition is true 

1 = dont jump if condition is true 


0 


0 = wait on non-overlapped instruction to 
finish 

1 = wait on overlapped instruction to finish 



In respect of Jump Instructions, the operand A word specified the 
target address of the j ump instruction. If the S bit of the Minor Opcode is set 
to 0, then operand B specified a co-processor register to use as the source of 
the condition. The value of the operand B descriptor specifies the address of 
the register, and the value of the operand B word defines a value to compare 
the contents of the register against. The operand C word specifies a bitwize 
mask to apply to the result. That is, the Jump Instruction's condition is true 
of the bitwize operation: 

({(register_vaiue xor Operand B) and Operand C) = 0x00000000) 

Further instructions are also provided for accessing registers for 
providing full control at the micro programmed leveL 
3.18 Modules of the Accelerator Card 

Turning again to Fig. 2, there will now be provided further separate 
description of the various modules. 
3,18,1 Pixel Organizer 

The pixel organizer 246 addresses and buffers data streams from the 
input interface switch 252. The input data is stored in the pixel organizer's 
internal memory or buffered to the MUV buffer 250. Any necessary data 
manipulation is performed upon the input stream before it is delivered to the 
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main data path 242 or JPEG coder 241 as required. The operating modes of 
the pixel organizer are configurable by the usual CBus interface. The pixel 
organizer 246 operates in one of five modes, as specified by a PO_CFG control 
register. These modes include: 

(a.) Idle Mode - where the pixel organizer 246 is not performing any 
operations. 

(b) Sequential Mode - when input data is stored in an internal FIFO and 
the pixel organizer 246 sends out requests for data to the input interface 
switch 252, generating 32 bit addresses for this data. 

(c) Color Space Conversion Mode - when the pixel organizer buffers pixels 
for color space conversion. In addition, requests are made for interval and 
fractional values stored in the MUV buffer 250. 

(d) JPEG Compression Mode - when the pixel organizer 246 utilizes the 
MUV buffer to buffer image data in the form of MClPs. 

(e) Convolution and Image Transformation Mode - when the pixel 
organizer 246 stores matrix co-efficients in the MUV buffer 250 and passes 
them, as necessary, to the main data path 242. 

The MUV buffer 250 is therefore utilized by the pixel organizer 246 
for both main data path 242 and JPEG coder 241 operations. During color 
space conversion, the MUV RAM 250 stores the interval and fractional tables 
and they are accessed as 36 bits of data (four color channels) x(4 hit interval 
values and 8 bit fractional values). For image transformation and 
convolution, the MUV RAM 250 stores matrix co-efficients and related 
configuration data. The co efficient matrix is limited to 16 rows x 16 
columns with each co-efficient being at a maximum 20 bits wide. Only one 
co-efficient per clock cycle is required from the MUV RAM 250. In addition 
to co-efficient data, control information such as binary point, source start 
coordinates and sub-6ample deltas must be passed to the main data path 242. 
This control information is fetched by the pixel organizer 246 before any of the 
matrix co-efficients are fetched 
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During JPEG compression, the MUV buffer 250 is utilized by the pixel 
organizer 246 to double buffer MCU's. Preferably, the technique of double 
buffering is employed to increase the performance of JPEG compression. One 
half of the MUV RAM 250 is written to using data from the input interface 
switch 252 while the other half is read by the pixel organizer to obtain data to 
send to the JPEG coder 241. The pixel organizer 246 is also responsible for 
performing horizontal sub-sampling of color components where required and 
to pad MCU's where an input image does not have a size equal to an exact 
integral number of MCUs. 

The pixel organizer 246 is also responsible for formatting input data 
including byte lane swapping, normalization, byte substitution, byte packing 
and unpacking and replication operations as hereinbefore discussed with 
reference to Fig. 32 of the accompanying drawings. The operations are 
carried out as required by setting the pixel organizers registers. 

Turning now to Fig. 108, there is shown the pixel organizer 246 in 
more detail. The pixel organizer 246 operates under the control of its own set 
of registers contained within a CBus interface controller 801 which is 
interconnected to the instruction controller 235 via the global CBus. The 
pixel organizer 24G includes an operand fetch unit 802 responsible for 
generating requests from the input interface switch 252 for operand data 
needed by the pixel organizer 246. The start address for operand data is 
given by the POJJAID register which must be set immediately before 
execution. The PCLSAID register may also hold immediate data, as 
specified by the L bit in the POJDMR register. The current address pointer 
in stored in the PO_CDP register and is incremented by the burst length of 
any input interface switch request. When data is fetched into the MUV 
RAM 250, the current offset for data is concatenated with a base address for 
the MUV RAM 250 as given by the PL JVfUV register. 

A FIFO 803 is utilized to buffer sequential input data fetched by the 
operand fetch unit 802. The data manipulation unit 804 is responsible for 
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implementing for implementing the variouB manipulations as described with 
reference to Fig. 32. The output of the data manipulation unit is passed to 
the MUV address generator 805 which is responsible for passing data to the 
MUV RAM 250, main data path 242 or JPEG coder 241 in accordance with 
configuration registers. A pixel organizer control unit 806 is a state machine 
that generates the required control signals for all the sub-modules in the pixel 
organizer 246. Included in these signals are those for controlling 
communication on the various Bus interfaces. The pixel organizer control 
unit outputs diagnostic information as required to the miscellaneous module 
239 according to its statue register settings. 

Turning now to Fig. 109, there is illustrated the operand fetch unit 
802 of Fig. 108 in more detail The operand fetch unit 802 includes an 
Instruction Bus address generator (IAG) 810 which contains a state machine 
for generating requests to fetch operand data. These requests are sent to a 
request arbiter 811 which arbitrates between requests from the address 
generator 810 and those from the MUV address generator 805 (Fig. 108) and 
sends the winning requests to the input (MAG) interface switch 252. The 
request arbiter 811 contains a state machine to handle requests. It monitors 
the state of the FIFO via FIFO count unit 814 to decide when it should 
dispatch the next request. A byte enable generator 8 12 takes information on 
the IAG 810 and generates byte enable patterns 816 specifying the valid bytes 
within each operand data word returned by the input interface switch 252. 
The byte enabled pattern is stored along with the associated operand data in 
the FIFO. The request arbiter 811 handles MAG requests before IAG 
requests when both requests arrive at the same time. 

Returning to Fig. 108, the MUV address generator 805 operates in a 
number of different modes. A first of these modes is the JPEG (compression) 
mode. In this mode, input data for JPEG compression is supplied by the 
data manipulation units 804 with the MUV buffer 250 being utilized as a 
double buffer. The MUV RAM 250 address generator 805 is responsible for 
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generating the right addresses to the MUV buffer to store incoming data 
processed by the data manipulation unit 804. The MAG 805 is also 
responsible for generating read addresses to retrieve color component data 
from the stored pixels to form 8x8 blocks for JPEG compression. The MAG 
805 is also responsible for dealing with the situation when a MCU lies 
partially on the image. In Fig, 110, there is illustrated an example of a 
padding operation carried out by the MAG 805. 

For normal pixel data, the MAG 805 stores the four color components 
at the same address within the MUV RAM 250 in four 8 bit rams. To 
facilitate retrieval of data from the same color channel simultaneously, the 
MCU data is barrel shifted to the left before it is stored in the MUV RAM 
250. The number of bytes the data is shifted to the left is determined by the 
lowest two bits of the write address. For example, in Fig. Ill there is 
illustrated the data organization within the MUV RAM 250 for 32 bit pixel 
data when no sub-sampling is needed. Sub-sampling of input data maybe 
selected for three or four channel interleaved JPEG mode. In multichannel 
JPEG compression mode with sub sampling operating, the MAG 805 (Fig. 108) 
performs the sub-sampling before the 32 bit data is stored in the MUV RAM 
250 for optimal JPEG coder performance. For the first four incoming pixels, 
only the first and fourth channels stored in the MUV RAM 250 contains 
useful data. The data in the second and third channel is sub-sampled and 
stored in a register inside the pixel organizer 246. For the next four incoming 
pixels, the second and third channel are filled with sub-sampled data. In Fig, 
112, there is illustrated an example of MCU data organization for multi- 
channel sub-sampling mode. The MAG treats all single channel unpacked 
data exactly the same as multi-channel pixel data. An example of single 
channel packed data as read from the MUV RAM is illustrated in Fig. 113. 

While the writing process is storing an incoming MCU into the MUV 
RAM, the reading process is reading 8x8 blocks out of the MUV RAM. In 
general, the blocks are generated by the MAG 805 by reading the data for each 
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channel sequentially, four co-efficients at the time. For pixel data and 
unpacked input data, the stored data is organized as illustrated in Fig. 111. 
Therefore, to compose one 8 x 8 block of non-sampled pixel data, the reading 
process reads data diagonally from the MUV RAM. An example of this 
process is illustrated in Fig. 114, which shows the reading sequence for four 
channel data, the form of storage in the MUV RAM 250 assisting to read 
multiple values for the same channel simultaneously. 

When operating in color conversion mode, the MUV RAM 250 is used 
as a cache to hold the interval and fractional values and the MAG 805 
operates as a cache controller. The MUV RAM 250 caches values for three 
color channels with each color channel containing 256 pairs of four bit interval 
and fractional values. For each pixel output via the DMU, the MAG 805 is 
utilized to get the values from the MUV RAM 250. Where the value is not 
available, the MAG 805 generates a memory read request to fetch the missing 
interval and fractional values. Instead of fetching one entry in each request, 
multiple entries are fetched simultaneously for better utilization of 
bandwidth. 

For image transformation and convolution, the MUV RAM 250 stores 
the matrix co-efficients for the MDP. The MAG cycles through all the matrix 
co-efficient stored in the MUV RAM 250. At the start of an image 
transformation and convolution instruction, the MAG 805 generates a request 
to the operand fetch unit to fetch the kernal description "header" (Fig. 94) and 
the first matrix co-efficient in a burst request. 

Turning now to Fig. 115, there is illustrated the MUV address 
generator (MAG) 805 of Fig: 108 in more detail. The MAG 805 includes an 
IBus request module 820 which multiplexers BBus requests generated by an 
image transformation controller (TTX) 821 and a color space conversion (CSC) 
controller 822. The requests are sent to the operand fetch unit which services 
the request. The pixel organizer 246 is only operated either in image 
transformation or color space conversion mode. Hence, there is no 
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arbitration required between the two controllers 82 1, 822. The IBus request 
module 820 derives the information for generating a request to the operand 
fetch unit including the burst address and burst length from the relevant pixel 
organizer registers. 

A JPEG controller 824 is utilized when operating in JPEG mode and 
comprizes two state machines being a JPEG write controller and a JPEG read 
controller. The two controllers operate simultaneously and synchronize with 
each other through the use of internal registers. 

In a JPEG compression operation, the DMU outputs the MCU data 
which is stored into the MUV RAM. The JPEG Write Controller is 
responsible for horizontal padding and control of pixel subsampling, while the 
JPEG Read Controller is responsible for vertical padding. Horizontal 
padding is achieved by stalling the DMU output, and vertical padding is 
achieved by reading the previously read 8x8 block line. 

The JPEG Write Controller keeps track of the position of the current 
MCU and DMU output pixel on the source image, and uses this information 
to decide when the DMU has to be stalled for horizontal padding. When a 
MCU has been written into the MUV RAM 250, the JPEG Write Controller 
sets/resets a set of internal registers which indicates the MCU ie on the right 
edge of the image, or is at the bottom edge of the image. The JPEG Read 
Controller then uses the content of these registers to decide if it is required to 
perform vertical padding, and if it has read the last MCU on the imago. 

The JPEG Write Controller keeps track of DMU output data, and 
stores the DMU output data into the MUV RAM 250. 

The controller uses a set of registers to record the current position of 
the input pixel. Thi6 information is used to perform horizontally padding by 
stalling the DMU output. 

When a complete MCU has been written into the MUV RAM 250, the 
controller writes the MCU information into JPEG-RW-IPC registers which is 
later used by the JPEG Read Controller. 
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The controller enters the SLEEP state after the last MCU has been 
written into the MUV RAM 250. The controller stays in this state until the 
current instruction completes. 

The JPEG Read Controller read the 8x8 blocks from the MCUs stored 
in the MUV RAM 250. For multi-channel pixels, the controller reads the 
MCU several times, each time extracting a different byte from each pixel 
stored in the MUV RAM. 

The controller detects if it needs to perform vertical padding using the 
information provided by the JPEG-RW-IPC. Vertical padding is achieved by 
re-reading the last 8-bytes read from the MUV RAM 250. 

The Image Transformation Controller 821 is responsible for reading 
the kernel discriptor from the IBus and passes the kernel header to the MDP 
242, and cycles through the matrix co-efficients as many times as specified in 
the po.len register. All data output by the PO 246 in an image 
transformation and Convolution instruction are fetched directly from the IBus 
and not passed through the DMU. 

The top eight bits of the first matrix co-efficient fetched immediately 
after the kernel header contains the number of remaining matrix co-efficients 
to be fetched. The kernel header is passed to the 

MDP directly without modifications, whilst the matrix coefficients arc sign 
extended before they are passed to the MDP. 

The pixel sub-sampler 825 comprizes two identical channel sub- 
samplers, each operating on a byte from the input word. When the relevant 
configuration register is not asserted, the pixel sub-sampler copies its input to 
its output. When the configuration register is asserted, the sub-sampler sub- 
samples the input data either by taking the average or by decimation. 

An MUV multiplexer module 826 selects the MUV read and write 
signals from the currently active controller. Internal multiplexers are used to 
SGlect the read addresses output via the various controllers that utilize the 
MUV RAM 250. An MUV RAM write address is held in an 8 bit register in 
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an MUV multiplexer module. The controllers utilising the MUV RAM 250, 
load the write address register in addition to providing control for determining 
a next MUV RAM address. 

A MUV valid access module 827 is utilized by the color space 
conversion controller to determine if the interval and fractional values for a 
current pixel output by the data manipulation unit is available in the MUV 
RAM 250. When one or more color channels are missing, the MUV valid 
access module 827 passes the relevant address to the IBua request module 
820 for loading in burst mode, interval and fractional values. Upon servicing 
a cache miss, the MUV valid access module 827 sets internal validity bits 
which map the set of interval and fractional values fetched so far. 

A replicate module 829 replicates the incoming data, the number of 
times as specified by an internal pixel register. The input stream is stalled 
while the replication module is replicating the current input word. A PBus 
interface module 630 is utilized to re-time the output signals of the pixel 
organizer 246 to the main data path 242 and JPEG coder 241 and vice versa. 
Finally, a MAG controller 831 generates signals for initiating and shutting 
down the various sub-modules. It also performs multiplexing of incoming 
PBus signals from the main data path 242 and JPEG coder 241. 
3.18.2 MUV Buffer 

Returning to Fig. 2, it will be evident from the foregoing discussion 
that the pixel organizer 246 interacts with the MUV buffer 250. 

The reconfigurable MUV buffer 250 is able to support a number of 
operating modes including the single lookup table mode (modeO), multiple 
lookup table mode (model), and JPEG mode (mode2). A different type of 
data object is stored in the buffer in each mode. For instance, the data 
objects that are stored in the buffer can be data words, values of a multiplicity 
of lookup tables, single channel data and multiple channel pixel data. In 
general, the data objects can have different sizes. Furthermore, the data 
objects stored in the reconfigurable MUV buffer 250 can be accessed in 
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substantially different ways which is dependent on the operating mode of the 
buffer. 

To facilitate the different methods needed to store and retrieve 
different types of data objects, the data objects are often encoded before they 
are stored. The coding scheme applied to a data object is determined by the 
size of the data object, the format that the data objects are to be presented, 
how the data objects are retrieved from the buffer, and also the organization 
of the memory modules that comprize the buffer. 

Fig. 116 is a block diagram of the components used to implement the 
reconfigurable MUV buffer 250. The reconfigurable MUV buffer 250 
comprizes an encoder 1290, a storage device 1293. a decoder 1291, and a read 
address and rotate signal generator 1292. When a data object arrives from 
an input data stream 1295, the data object may be encoded into an internal 
data format and placed on the encoded input data stream 1296 by the encoder 
1290. The encoded data object is stored in the storage device 1293, 

When decoding previously stored data objects, an encoded data object 
is read out of the storage device via encoded output data stream 1297. The 
encoded data object in the encoded output data stream 1297 is decoded by a 
decoder 1291. The decoded data object is then presented at the output data 
stream 1298, 

The write addresses 1305 to the storage device 1293 are provided by 
the MAG 805 (Fig 108). The read addresses 1299, 1300 and 1301 are also 
provided by the MAG 805 (Fig 108), and translated and multiplexed to the 
storage device 1293 by the Read Address and Rotate Signal Generator 1292, 
which also generates input and output rotate control signals 1303 and 1304 to 
the encoder and decoder respectively. The write enable signals 1306 and 
1307 are provided by an external source. An operating mode signal 1302 > 
which is provided by means of the controller 801(Fig 108), is connected to the 
encoder 1290, the decoder 1291, the Read Address and Rotate Signal 
Generator 1292, and the storage device 1293, An increment signal 1308 
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increments internal counters) in the read address and rotate signal generator 
and may be utilized in JPEG mode (mode2). 

Preferably, when the reconfigurable MUV buffer 250 is operating in 
the single lookup table mode (modeO), the buffer behaves substantially like a 
single memory module. Data objects may be stored into and retrieved from 
the buffer in substantially the same way used to access memory modules. 

When the reconfigurable MUV buffer 250 is operating in the multiple 
lookup table mode (mode 1), the buffer 250 is divided into a plurality of tables 
with up to three lookup tables may be stored in the storage device 1293. The 
lookup tables may be accessed separately and simultaneously. For instance, 
in one example, interval and fraction values are stored in the storage device 
1293 in the multiple lookup table mode, and the tables are indexed utilizing 
the lower three bytes of the input data stream 1295. Each of the three bytes 
are issued to access a separate lookup table stored in the storage device 1293. 

When an image undergoes JPEG compression, the image is converted 
into an encoded data stream. The pixels are retrieved in the form of MCUs 
from the original image. The MCUs are read from left to right, and top to 
bottom from the image. Each MCU is decomposed into a number of single 
component 8x8 blocks. The number of 8x8 blocks that can be extracted from 
a MCU depends on several factors including; the number of color components 
in the source pixels, and for a multiple channel JPEG mode, whether 
subsampling is needed. The 8x8 blocks are then subjected to forward DCT 
(FDCT), quantization, and entropy encoding. In the case of JPEG 
decompression, the encoded data are read sequentially from a data stream. 
The data stream undergoes entropy decoding, dequantization and inverse 
DCT (IDCT). The output of the IDCT operation are 8x8 blocks. A number of 
single component 8x8 blocks are combined to reconstruct a MCU, As with 
JPEG compression, the number of single component 8x8 blocks are dependent 
on the same factors mentioned above. The reconfigurable MUV buffer 250 
may be used in the process to decompose MCUs into a multiplicity of single 
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component 8x8 blocks, to reconstruct MCUa from a multiplicity of single 
component 8x8 blocks. 

When the reconfigurable MUV buffer 250 is operating in JPEG mode 
(mode2), the input data stream 1295 to the buffer 250 comprizes pixels for a 
JPEG compression operation, or single component data in a JPEG 
decompression operation. The output data stream 1298 of the buffer 250 
comprizes single channel data blocks for a JPEG compression operation, or 
pixel data in a JPEG decompression operation. In this example, for a JPEG 
compression operation, an input pixel may comprize up to four channels 
denoted Y, U, V and O. When the required number of pixels have been 
accumulated in the buffer to form a complete pixel block, the extraction of 
single component data blocks can commence. Each single component data 
block comprizes data from the like channel of each pixel stored in ihe buffer. 
Thus in this example, up to four single component data blocks may be 
extracted from one pixel data block. In this embodiment, when the 
reconfigurable MUV buffer 250 is operating in the JPEG mode (mode2) for 
JPEG compression, a multiplicity of Minimum Coded Units (MCUs) each 
containing 64 single or 64 multiple channel pixels may he stored in the buffer, 
and a multiplicity of 64-byte long single channel component data blocks are 
extracted from each MCU stored in the buffer. In this embodiment, for the 
buffer 1289 operating in the JPEG mode (raode2) for a JPEG decompression 
operations, the output data stream contains output pixels that have up to four 
components Y, U, V and O. When the required number of complete single 
component data blocks have been written into the buffer, the extraction of 
pixel data may commence. A byte from up to four single component block 
corresponding to different color components are retrieved to form an output 
pixel. 

Fig, 117 illustrates the encoder 1290 of Fig. 116 in more detail. For 
the pixel block decomposition mode only, each input data object is encoded 
using a byte-wize rotation before it is stored into the storage device 1293 (Fig, 
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129), The amount of rotation is specified by the input rotate control signal 
1303. As the pixel data has a maximum of four bytes in this example, a 32- 
bit 4-to-l multiplexer 1320 and output 1325 is used to select one of the four 
possible rotated versions of the input pixel. For example, if the four bytes in 
a pixel are labelled (3,2,1,0), the four possible rotated versions of this pixel 
are (3,2,1,0), (0,3,2,1), (1,0,3 ; 2) and (2,1,0.3). The four encoded bytes are 
output 1296 for storage in the storage device. 

When the buffer is placed in an operating mode other than the JPEG 
mode (mode2), for example, single lookup table mode (modeO) and multiple 
lookup table mode (model), byte-wize rotation may not be necessary and may 
not be performed on the input data objects. The input data object is 
prevented from being rotated in the latter cases by overriding the input rotate 
control signal with a no-operation value. This value 1323 can be zero. A 2-to- 
1 multiplexer 1321 produces control signals 1326 by selecting between the 
input rotate control signal 1303 and the no-operation value 1323. The 
current operating mode 1302 is compared with the value assigned to the pixel 
block decomposition mode to produce the multiplexer select signal 1322. The 
4-to-l multiplexer 1320, which is controlled by signal 1326 selects one of the 
four rotated version of the input data object on the input data stream 1325, 
and produces an encoded input data object on the encoded input data stream 
1320. 

Fig. 118 illustrates a schematic of a combinatorial circuit which 
implements the decoder 1291 for the decoding of the encoded output data 
stream 1297. The decoder 1321 operates in a substantially similar manner 
to the encoder. The decoder only operates on the data when the data buffer is 
in the JPEG mode (modo2). The lower 32-bit of an encoded output data 
object in the encoded output data stream 1297 is passed to the decoder. The 
data is decoded using a byte-wize rotation with an opposite sense of rotation 
to the rotation performed by the encoder 1290. A 32-bit 4-to-l multiplexer 
1330 is uBed to select one of the four possible rotated version of the encoded 
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data. Fox example, if the four bytes in an input pixel are labelled (3,2,1,0), 
the four possible rotated version of this pixel are (3,2,1,0), (2,1,0,3), (1,0,3,2) 
and (0,3,2,1). The output rotate control signal 1304 is utilized only when the 
buffer is in a pixel block decomposition mode, and when overridden by a no- 
operation value in other operating modes. The no-operation value utilized 
1333 is zero. A 2-to-l multiplexer 1331 produces signal 1334 by selecting 
selects between the output rotate control signal 1304 and the no-operation 
value 1333. The current operating mode 1302 is compared with the value 
assigned to the pixel block decomposition mode to produce the multiplexer 
select signal 1332. The 4-to-l multiplexer 1330, which is controlled by signal 
1334, selects one of the four rotated version of the encoded output data object 
on the encoded output data stream 1297, and produces an output data object 
on the output data stream 1298. 

Returning to Fig. 116, the method of internal read address generation 
used by the circuit is selected by the operating mode 1302 of the 
reconfigurable MUV buffer 250. For the single lookup table mode (modeO) 
and multiple lookup table mode (model), the read addresses are provided by 
the MAG 805 (Fig 108) in the form of external read addresses 1299, 1300, and 
1301. For the single lookup table mode (modeO), the memory modules 1380, 
1381, 1382, 1383, 1384 and 1385 (Fig. 121) of the storage device 1293 operate 
together. The read address and the write address supplied to the memory 
modules 1380 to 1385 (Fig. 121) are substantially the same. Hence the 
storage device 1293 only needs the external circuits to supply one read 
address and one write address, and uses internal logic to multiplex these 
addresses to the memory modules 1380 to 1385 (Fig. 121). For modeO, the 
read address is supplied by the external read address 1299 (Fig. 116) and is 
multiplexed to the internal read address 1348 (Fig. 121) without substantial 
changes. The external read addresses 1300 and 1301 (Fig. 116), and the 
internal read addresses 1349, 1350 and 1351 (Fig. 121), are not used in 
modeO. The write address is supplied by the external write address 1305 
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(Fig. 116), and is connected to the write address of each memory module 1380 
to 1385 (Fig. 121) without substantial modification. 

In this example, a design that provides three lookup tables in the 
multiple lookup table mode (mode 1) is presented. The encoded input data is 
written simultaneously into all memory modules 1380 to 1385 (Fig. 121), 
while the three tables are accessed independently, and thus require one index 
to each of the three tables. Three indices, that is, read addresses to the 
memory modules 1380 to 1385 (Fig. 121), are supplied to the storage device 
1293. These read addresses are multiplexed to the appropriate memory 
modules 1380 to 1385 using internal logic. In substantially the same 
manner as in the single lookup table mode, the write address supplied 
externally is connected to the write address of each of the memory modules 
1380 to 1385 without substantial modifications. Hence, for the multiple 
lookup table mode (mode 1), the external read addresses 1299, 1300 and 1311 
are multiplexed to internal read addresses 1348, 1349 and 1350 respectively. 
The internal road address 1351 is not used in mode L Tho method of 
generating the internal read addresses need in the JPEG mode (mode 2) is 
different to the method described above. 

Fig. 119 illustrates a schematic of a combinatorial circuit which 
implements the read address and rotate control signals generation circuit 
1292 (Fig. 116), for the reumfigurable data buffer operating in the JPEG mode 
(mode 2) for JPEG compression. In the JPEG mode (mode 2), the generator 
1292 uses the output of a component block counter 1340 and the output of a 
data byte counter 1341 to compute the internal read addresses to the memory 
modules comprising the storage device 1293. The component block counter 
1340 gives the number of component blocks extracted from a pixel data block, 
which is stored in the storage device. The number of like components extracted 
from the pixel data block is given by multiplying the output of the data byte 
counter 1341 by four. In this embodiment, an internal read address 1348, 
1349, 1350 or 1351 for the pixel data block decomposition mode is computed 
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as follows. The output of the component block counter is used to generate an 
offset value 1343, 1344, 1345, 1346 or 1347, and the output of the data byte 
counter 1341 is UBed to generate a base read address 1354. The offset value 
1343 is added 1358 to the base read address 1354 and the sum is an internal 
read address 1348 (or 1349, 1350 or 1351). The offset values for the memory 
modules are in general different for simultaneous read operations performed 
on multiple memory modules, but the offset value to each memory module is 
in general substantially the same during the extraction of one component data 
block. The base addresses 1354 used to compute the four internal read 
addresses in the pixel data block decomposition mode are substantially the 
same. The increment signal 1308 is used as the component byte counter 
increment signal. The counter is incremented after every successful read 
operation has been performed. A component block counter increment signal 
1356 is used to increment the component block counter 1340, after a complete 
single component data block has been retrieved from the buffer. 

The output rotate control signal 1304 (Fig. 116) is derived from the 
output of the component block counter, and the output of the data byte counter, 
in substantially similar manner to the generation of an internal road n ddress. 
The output of the component block counter is used to compute a rotation offset 
1347, The output rotate control signal 1304 is given by the lowest two hits of 
the sum of the base read address 1354 and the rotation offset 1355. The 
input rotate control signal 1303 is simply given by the lowest two bytes of the 
external write addresses 1305 in this example of the address and rotate 
control signals generator. 

Fig. 120 shows another example of the address generator 1292 for 
reassembling multiple channel pixel data from single component data stored 
in the reconfigurable MUV buffer 250. In this case, the buffer is operating in 
the JPEG (mode2) for JPEG decompression operation. In this case, single 
component data blocks are stored in the buffer, and pixel data blocks are 
retrieved from the buffer. In this example, the write address to the memory 
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modules are provided by the external write address 1305 without substantial 
changes. The single component blocks are stored in contiguous memory 
locations. The input rotate control signal 1303 in this example is simply set 
to the lowest two bits of the write address. A pixel counter 1360 is used to 
keep track of the number of pixels extracted from the single component blocks 
stored in the buffer. The output of the pixel counter is used to generate the 
read addresses 1348, 1349, 1350 and 1351, and the output rotate control 
signal 1304. The read addresses are in general different for each memory 
module that comprize the storage device 1293. In this example, a read 
address comprizes two parts, a single component block index 1362, 1363, 
1364 or 1365, and a byte index 1361. An offset is added to bit 3 and 4 of the 
output of the pixel counter to calculate the single component block index for a 
particular block The offsets 1366, 1367, 1368 and 1369 are in general 
different for each read address. Bit 2 to bit 0 of the output of the pixel 
counter are used as the byte index 1361 of a read address. A read address is 
the result of the concatenation of a single component block index 1362, 1363, 
1364 or 1365 and a byte index 1361, as illustrated in Fig. 120. In this 
example, the output rotate control signal 1304 is generated using bit 4 and bit 
3 of the output of the pixel counter without substantial change. The 
increment signal 1308 is used as the pixel counter increment signal to 
increment the pixel counter 1360. The pixel counter 1360 is incremented 
after a pixel has been successfully retrieved from the buffer. 

Fig. 121 illustrates an example of a structure of the storage device 
1293. The storage device 1293 can comprize three 4-bit wide memory 
modules 1383, 1384 and 1385, and three 8-bit wide memory modules 1380, 
1381 and 1382. The memory modules can be combined together to store 36- 
bit words in the single lookup table mode (modeO), 3x1 2-bit words in the 
multiple lookup table mode (model), and 32-bit pixels or 4x8-bit single 
component data in JPEG mode (mode2). Typically each memory module is 
associated with a different part of the encoded input and output data streams 
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(1296 and 1297). For example, memory module 1380 has its data input port 
connected to bit 0 to bit 7 of the encoded input data stream 1296, and its data 
output port connected to bit 0 to bit 7 of the encoded output data stream 1297. 
In this example, the write addresses to all the memory modules are connected 
together, and share substantially the same value. In contrast, the read 
addresses 1386, 1387, 1388, 1389,1390 and 1391 to the memory modules of 
the example illustrated in Fig. 121 are supplied by the read address generator 
1292, and are in general different. In the example, a common write enable 
signal is used to provide the write enable signals to all three 8-bit memory 
modules, and a second common write enable signal is used to provide the 
write enable signals to all three 4-bit memory modules. 

Fig, 122 illustrates a schematic of a combinatorial circuit used for 
generating read addresses 1386, 1387, 1388, 1389, 1390 and 1391 for 
accessing to the memory modules contained in a storage device 1293. Each 
encoded input data object is broken up into parte, and each part is stored into 
a separate memoiy module in the storage devioe. Hence, typically the write 
addresses to all memory modules for all operating modes are substantially 
the same and thus substantially no logic is required to compute the write 
address to the memory modules. The read addresses in this example, on the 
other hand, are typically different for different operations, and are also 
different to each memory module within each operating mode. All bytes in 
the output data 6tream 1298 of the reconfigurable MUV buffer 250 must 
contain single component data extracted from the pixel data stored in the 
buffer in the JPEG mode (mode2) for JPEG compression, or pixel data 
extracted from the single component data blocks stored in the buffer in the 
JPEG mode for JPEG decomposition The requirements on the output data 
stream are achieved by providing four read addresses 1348, 1349, 1350 and 
1351 to the buffer. In the multiple lookup table mode (model), up to three 
lookup tables are stored in the buffer, and thus only up to three read 
addresses 1348, 1349 and 1350 are needed to index the three lookup tables. 
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The read addresses to all memory modules are substantially the same in the 
single lookup table mode (modeO), and only read address 248 is used in this 
mode. The example controller circuit shown in Fig. 122 uses the operating 
mode signals to the buffer, and up to four read addresses, to compute the read 
address 1386 - 1391 to each of the six memory modules comprising the 
storage device 1293. The read address generator 1292 takes, as its inputs, 
the external read addresses 1299, which comprizes external address buses 
1348, 1349, 1350 and 1351, and generates the internal read addresses 1386, 
1387, 1388, 1389, 1390 and 1391 to the memory modules that comprize the 
storage device 1293. No manipulation on the external write addresses 1305 
is required in the operation of this example. 

Fig. 123 illustrates a representation of an example of how 20-bit 
matrix coefficients may be stored in the buffer 250 when the buffer 250 is 
operating in single lookup table mode (modeO). In this exa mple, typically no 
encoding is applied on the data objects stored in the cache when the data 
objects are written into the reconfigurahle MUV buffer. The matrix 
coefficients are stored in the 8-bit memory modules 1380, 1381 and 1382. 
Bit 7 to bit 0 of the matrix coefficient are stored in memory module 1380, bit 
15 to bit 8 of the matrix coefficient are stored in memory module 1381, and bit 
19 to bit 16 of the matrix coefficient arc stored in the lower 4 bits of memory 
module 1382. The data objects 6tored in the buffer may be retrieved as 
many times as required for the rest of the instruction. The write and read 
addresses to all memory modules involved in the single lookup table mode are 
substantially the same. 

Fig. 124 illustrates a representation of how the table entries are 
stored in the buffer in the multiple lookup table mode (model). Tn this 
example, up to three lookup tables may be stored in the buffer, and each 
lookup table entry comprizes a 4-bit intorval value and an 8-bit fraction valuo. 
Typically the interval values arc stored in the 4-bit memory modules, and the 
fraction values are stored in the 8-bit memory modules. The three lookup 
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tables 1410, 1411 and 1412 are stored in the memory banks 1380 and 1383, 
1381 and 1384, 1382 and 1385 in the example. The separate write enable 
control signals 1306 and 1307 (Fig. 121) allow the interval values to be 
written into the storage device 1293 without affecting the fraction values 
already stored in the storage device. In substantially the same manner, the 
fraction values may be written into storage device without affecting the 
interval values already stored in the storage device. 

Fig. 125 illustrates a representation of how pixel data is stored in the 
reconfigurable MUV buffer 25D when the JPEG mode <mode2) for 
docompoeing pixel data blocks into single component data blocks. The 
storage device 1293 is organized as four 8-bit memory banks, which comprizes 
the memory modules 1380, 1381, 1382, 1383 and 1384, with 1383 and 1384 
used together to operate substantially in the same manner as an 8-bit 
memory module. Memory module 1385 is not used in the JPEG mode 
(mode2). A 32-bit encoded pixel is broken up into four bytes, and each is 
stored into a different 8-bit memory module. 

Fig. 126 illustrates a representation of how the single component data 
blocks are stored in the storage device 1293 in single component mode. The 
storage device 1293 is organized as four 8-bit memory banks, which comprizes 
the memory modules 1380, 1381, 1382, 1383 and 1384, with 1383 and 1384 
used together to operate substantially in the same manner as an 8-bit 
memory module. A single component block in thie example comprizes 64 
bytes. A different amount of byte rotation can be applied to each single 
component block when it is written into the buffer, A 32-bit encoded pixel 
dnta is retrieved by reading from the different single component data block 
stored in the buffer. 

For further details on the organization of the data within the MUV 
buffer 250 reference is made herein to the section entitled Pixel Organizer. 

This preferred embodiment has shown that a reconfigurable data 
buffer may be used to handle data involved in different instructions. A 
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reconfigurable data buffer that provides three operating modes has been 
disclosed. Different address generation techniques may be needed in each 
operating mode of the buffer. The single look-up table mode (niodeO) may be 
used to store matrix coefficients in the buffer for an image transformation 
operation. The multiple look-up table mode (model) may be used to store a 
multiplicity of interval and fraction lookup tables in the buffer in a multiple 
channel color space conversion (CSC) operation. The JPEG mode (mode2) 
may be used either to decompose MCU data into single component 8x8 blocks, 
or to reconstruct MCU data from single-component 8x8 blocks, in JPEG 
compression and decompression operation respectively. 
3.18.3 Result Organizer 

The MUV buffer 250 is also utilized by the result organizer 249. The 
result organizer 249 buffers and formats the data stream from either the 
main data path 242 or the JPEG coder 241. The result organizer 249 also is 
responsible for data packing and unpacking, denormalization, byte lane 
swapping and realignment of result data as previously discussed with 
reference to Fig. 42. Additionally the result organizer 249 transmits its 
results to the external interface controller 238, the local memory controller 
23G, and the peripheral interface controller 237 as required. 

When operating in JPEG decompression mode, the results organizer 
249 utilizes the MUV RAM 250 to double buffer image data produced by the 
JPEG coder 241. Double buffering increases the performance of the JPEG 
decompression by allowing data from the JPEG coder 241 to be written to one 
half of the MUV RAM 250 while at the same time image data presently in 
the other half of the MUV RAM 250 is output to a desired destination. 

The 1, 3 and 4 channel image data is passed to the result organizer 
249 during JPEG decompression in a form of 8 x 8 blocks with each block 
consisting of 8 bit components from the same channel. The result organizer 
stores these blocks in the MUV RAM 250 in the order provided and then, for 
multi-channel interleaved images, meshing of the channels in performed when 
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reading data from the MUV RAM 250. For example, in a three channel 
JPEG compression based on Y, U, V color space, the JPEG coder 241 outputs 
three 8x8 blocks, the first consisting of Y components, the second made of the 
U components and the third made tip of the V components. Meshing is 
accomplished by taking one component from each block and constructing the 
pixel in the form of (YUVX) where X represents an unused channel. Byte 
swapping may be applied to each output to swap the channels as desired. 
The result organizer 249 must also do any required sub-sampling to 
reconstruct chroma-data from decompressed output. This can involve 
replicating each program channel to produce and an one. 

Turning to Fig. 127, there is illustrated the result organizer 249 of Fig. 
2 in more detail. The result organizer 249 is based around the usual 
standard CBus interface 840 which includes a register file of registers to be 
set for operation of the result organizer 249. The operation of the result 
organizer 249 is similar to that of the pixel organizer 246, however the reverse 
data manipulation operations take place. A data manipulation unit 842 
performs byte lane swapping, component substitution, component deselection 
and denormalization operations on data provided by the MUV address 
generator (MAG) 805. The operations carried out are those previously 
described with reference to Fig. 42 and operate in accordance with various 
fields set in internal registers. The FIFO queue 843 provides buffering of 
output data before it is output via EBus control unit 844. 

The RButi control unit 844 is composed of an address decoder and 
state machines for address generation. The address for the destination 
module is stored in an internal register in addition to data on the number of 
output bytes required. Further, an internal RCLCUT register specifies how 
many output bytes to discard before sending a byte stream on the output bus. 
Additionally, a RO_LMT register specifies the maximum number of data 
items to bo output with subsequent data bytes after the output limit being 
ignored. The MAG 805 generates addresses for the MUV RAM 250 during 
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JPEG decompression. The MUV RAM 250 is utilized to double buffer 
output from the JPEG decoder. The MAG 805 performs any appropriate 
meshing of components in the MUV RAM 250 in accordance with an internal 
configuration register and outputs single channel, three channel or four 
channel interleaved pixels. The data obtained from the MUV RAM 250 ie 
then passed through the data manipulation unit 842, since byte lane 
swapping may need to be applied before pixel data is sent to the appropriate 
destination. When the results organizer 249 is not configured for JPEG mode, 
the MAG 805 simply forwards data from the PBub receiver 845 straight 
through to the data manipulation unit 842. 
3.18.4 Operand Organizers B and C 

Returning again to Fig. 2, the two identical operand organizers 247, 
248 perform the function of buffering data from the data cache control 240 and 
forwarding the data to the JPEG coder 241 or the main data path 242. The 
operand organizers 247, 248 are operated in a number of modes: 

(a) Tdle mode wherein the operand organizer only responds to CBus 
requests. 

(b) Immediate mode when the data of the current instruction is 
stored in an internal register of the operand organizer. 

(c) Sequential mode wherein the operator organizer generates 
sequential addresses and requests data from the data cache controller 240 
whenever its input buffer requires filling. 

A number of modes of operation of the main data path 242 require at 
least one of the operand organizers 247, 248 to operate in sequential mode. 
These modes include compositing wherein operand organizer B 247 is 
required to buffer pixels which are to be composited with another image. 
Operand organizer C 248 is used for compositing operations for attenuation of 
values for each data channel. La halftoning mode, operand organizer B 247 
buffers 8 bit matrix co-efficients and in hierarchial image format 
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decompression mode the operand organizer B 247 buffers data for both 
vertical interpolation and residual merging instructions. 

(d) In constant mode, an operand organizer B constructs a single 
internal data word and replicates this word a number of times as given by an 
internal register. 

(e) In tiling mode an operand organizer B buffers data that 
comprizes a pixel tile. 

(f) In random mode the operand organizer forwards addresses from 
the MDP 242 or JPEG coder 241 directly to the data cache controller. These 
addresses are utilized to index the data cache 230. 

An internal length register specifies the number of items to be 
generated by individual operand organizers 247, 248 when operated in 
sequent ial/titling/constant mode. Each operand organizer 247, 248 keeps 
account of the number of data items processed so far and stops when the count 
reaches the value specified in its internal register. Each operand organizer is 
further responsible for formatting input data via byte lane swapping, 
component substitution., packed/unpacked and normalization functions. The 
desired operations are configured utilising internal registers. Further, each 
operand organizer 247, 248 may also bo configured to constrict data items. 

Turning now to Fig. 128, there is illustrated the structure of operand 
organizers (247, 248) in more detail The operand organiser 247, 248 
includes the usual standard CBus interface and registers 8G0 responsible for 
the overall control of the operand organizer. Further, an OBus control unit 
851 is provided for connection to the data cache controller 240 and is 
responsible for performing address generation for sequential/tile/ constant 
modes, generating control signals to enable communications on the OBus 
interface to each operand organizer 247, 248 and controlling data 
manipulation unit operations such as normalization and replication, that 
require the state to be saved from previous clock cycles of the input stream. 
When an operand organizer 247, 248 is operating in sequential or tiling mode, 
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the OBus control unit 851 sends requests for data to the data cache controller 
240, the addresses being determined by internal registers. 

Each operand organizer further contains a 36 bit wide FIFO buffer 852 
used to buffer data from the data cache controller 240 in various modes of 
operation. 

A data manipulation unit 853 performs the same functions as the 
corresponding data manipulation unit 804 of th9 pixel organizer 246. 

A main data path/JPEG coder interface 854 multiplexer address and 
data to and from the main data path and JPEG coder modules 242, 241 in 
normal operating mode. The MDP/JC interface 854 passes input data from 
the data manipulation units 853 to the main data path and in the process 
may be configured to replicate this data. When operating in color conversion 
mode, the units 851, 854 are bypassed in order to ensure high speed access to 
the data cache controller 240 and the color conversion tables. 
3.18.6 Main Data Path Unit 

The aspects of the following embodiment relate to an image processor 
providing a low cost computer architecture capable of performing a number of 
image processing operations at high speed. Still further, the image processor 
seeks to provide a flexible computer architecture capable of being configured 
to perform image processing operations that are not originally specified. The 
image processor also seeks to provide a computer architecture having a large 
amount of identical logic, which simplifies the design process and lowers the 
cost of designing such an architecture. 

The computer architecture comprises a control register block, a 
decoding block, a data object processor, and flow control logic. The control 
register block stores ail the relevant information about the image processing 
operation. The decoding block decodes the information into configuration 
signals, which configure an input data object interlace. The input data object 
interface accepts and stores data objects from outside, and distributes these 
data objects to the data object processor. For some image processing 
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operations, the input data object interfaoe may also generate addresses for 
data objects, so that the source of these data objects can provide the correct 
data objects. The data object processor performs arithmetic operations on 
the data objects received. The flow control logic controls the flow of data 
objects within the data object processing logic. 

More particularly, the data object processor can comprise a number of 
identical data object sub-processor, each of which processes part of an 
incoming data object. The data object sub-processor includes a number of 
identical multifunctional arithmetic units that perform arithmetic operations 
on these parts of data objects, post processing logic that processes the 
outgoing data objects, and multiplexer logic that connects the multifunctional 
arithmetic units and the post-processing unit together. The multifunctional 
arithmetic units contain storage for parts of the calculated data objects. The 
storage is enabled or disabled by the flow control logic. The multifunctional 
arithmetic units and multiplexer logic are configured by the configuration 
signals generated by the decoding logic. 

Furthermore, the configuration signals from the decoding logic can be 
overridden by an external programming agent. Through this mechanism any 
multifunctional blocks and multiplexer logic can be individually configured by 
an external programming agent, allowing it to configure the image processor 
to perform image processing operations that are not specified beforehand. 
These and other aspects of the embodiments of the invention are described in 
greater detail hereinafter. 
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Returning to Fig. 2, as noted previously the main data path unit 242 
performs all data manipulation operations and instructions other than JPEG 
data coding. These instructions include compositing, color space conversion, 
image transformations, convolution, matrix multiplication, halftoning, 
memoiy copying and hierarchial image format decompression. The main 
data path 242 receives pixel and operand data from the pixel organizer 246, 
and operand organizers 247, 248 and feeds the resultant output to the result 
organizer 249. 

Fig.129 illustrates a block diagram of the main data path unit 242, 
The main data path unit 242 is a general image processor and includes input 
interface 1460, image data processor 1462, instruction word register 1464, 
instruction word decoder 1468, control signal register 1470, register file 1472, 
and a ROM 1475. 

The instruction controller 235 transfers instruction words to the 
instruction word register 1464 via bus 1454. Each instruction word contains 
information such as the kind of image processing operation to be executed, 
and flags to enable or disable various options in that image processing 
operation. The instruction word is then transferred to the instruction word 
decoder 1468 via bus 1465. Instruction controller 235 can then indicate to 
the instruction word decoder 1468 to decode the instruction word. Upon 
receiving that indication, the instruction decoder 1468 decodes the instruction 
word into control signals. These control signals are then transferred via bus 
1469 to the control signal register 1470. The output of the control signal 
register is then connected to the input interface 1460 and image data 
processor 1462 via bus 1471. 

To add further flexibility to the main data path unit 242, the 
instruction controller 235 can also write into the control signal register 1470. 
This allows anyone who is familiar with the structure of the main data path 
unit 242 to micro -configure the main data path unit 242 so that the main 
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data path unit 242 will execute image processing operations that are not be 
described by any instruction word. 

In cases when all the necessary information to perform the desired 
image processing operation does not fit into the instruction word, the 
instruction controller 235 can write all the other information necessary to 
perform tho desired image processing operation into some of the selected 
registers in register file 1472. The information is then transferred to the 
input interface 1460 and the image data processor 1462 via bus 1473. For 
some image processing operations, the input interface 1460 may update tho 
contents of selected registers in the register file 1472 to reflect the current 
status of the main data path unit 242. This feature helps the instruction 
controller 235 to find out what the problem is when there is a problem in 
executing an image processing operation. 

Once the decoding of the instruction word is finished, and/or the 
control signal register is loaded with the desired control signals, the 
instruction controller 235 can indicate to the main data path unit 242 to start 
performing the desired image processing operation. Once that indication is 
received, the input interface 1460 begins to accept data objects coming from 
bus 1451, Depending on the kind of image processing operation performed, 
the input interface 1460 may also begins to accept operand data coming from 
operand bus 1452 and/or operand bus 1453, or generates addresses for 
operand data and receive operand data from operand bus 1452 and/or operand 
bus 1453. The input interface 1460 then stores and rearranges the incoming 
data in accordance with the output of the control signal register 1470. The 
input interface 1460 also generates coordinates to be fetched via buses 1452 
and 1453 when calculating such functions as affine image transformation 
operations and convolution. 

The image data processor 1462 performs the major arithmetic 
operations on the rearranged data objects from the input interface 1460. The 
image processor 1462 can: interpolate between two data objects with a 
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provided interpolation factor; multiply two data objects and divide the 
product by 255; multiply and add two data objects in general; round off 
fraction pares of a data object which may have various resolutions; clamp 
overflow of a data object to some maximum value and underflow of a data 
object to some minimum value; and perform scaling and clamping on a data 
object. The control signals on bus 1471 control which of the above arithmetic 
operations are performed on the data objects, and the order of the operations. 

A ROM 1475 contains the dividends of 255/x, where xis from 0 to 255, 
rounded in 8.8 format. The ROM 1475 is connected to the input interface 1460 
and the image data processor 1462 via bus 1476, The ROM 1475 is used to 
generate blends of short lengths and multiply one data object by 255 and 
dividing the product by another data object. 

Preferably, the number of operand buses eg 1452 is limited to 2, which 
is sufficient for most image processing operations. 

Fig. 130 illustrates the input interface 1460 in further detail. Input 
interface 1460 includes data object interface unit 1480, operand interface 
units 1482 and 1484, address generation state machine I486, blend 
generation state machine 1488, matrix multiplication state machine 1490, 
interpolation state machine 1490, data synchronizer 1500, arithmetic unit 
1496, miscellaneous register 1498, and data distribution logic 1505. 

Data object interface unit 1480 and operand interface units 1482 and 
1484 are responsible to receive data objects and operands from outbids. 
These interface units 1482, 1484 are all configured by control signals from 
control bus 1515. These interface units 1482, 1484 have data registers 
within t hem to contain the data objects/operands that they have just received, 
and they all produce a VALE) signal which is asserted when the data within 
the data register is valid. The outputs of the data registers in these interface 
units 1482, 1484 are connected to data bus 1505. The VALID signals of 
these interface units 1482, 1484 are connected to flow bus 1510. When 
configured Lo fetch operands, operand interface units 1482 and 1484 accept 
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addressee from arithmetic unit 1496, matrix multiplication state machine 
1490 and/or the output of data register in data object interface unit 1480, and 
select amongst them the required address in accordance with the control 
signals from control bus 1515. In some cases, the data registers in operand 
interface units 1482 and 1484 can be configured to store data from the output 
of data register in data object interface unit 1480 or arithmetic unit 1496. 
especially when they are not needed to accept and store data from outside. 

Address generation state machine 1486 is responsible for controlling 
arithmetic unit 1496 so that it calculates the next coordinates to be accessed 
in the source image in affme image transformation operations and convolution 
operations. 

The address generation state machine 1486 waits for START signal 
oncontrolbus 1615 to beset. When the START signal on control bus 1615 is 
set, address generation state machine i486 then de-asserts the STALL signal 
to data object interface unit 1480, and waits for data objects to arrive. It also 
sets a counter to be the number of data objects in a kernel descriptor that 
address generation state machine 1486 needs to fetch. The output of the 
counter is decoded to become enable signals for data registers in operand 
interface units 1482 and 1484 and miscellaneous register 1498. When the 
VALID signal from data object interface unit 1480 is asserted, address 
generation state machine 1486 decrements the counter, so the next piece of 
data object is latched into a different register. 

When the counter reaches zero, address generation state machine 
1486 tells operand interface unit 1482 to start fetching index table values and 
pixels from operand interface unit 1484, Also, it loads two counters, one with 
the number of rows, another with the number of columns. At every clock edge, 
when it is not paused by STALL signals from the operand interface unit 1482 
or others, the counters are decremented to give the remaining rows and 
columns, and the arithmetic unit 1496 calculates the next coordinates to be 
fetched from. When both counters have reached zero, the counters reload 
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themselves with the number of rows and columns again, and arithmetic unit 
1496 is configured to find the top left hand corner of the next matrix. 

If interpolation is used to determine the true value of a pixel, address 
generation state machine 1486 decrements the number of rows and columns 
after every second clnck cycle. This is implemented using a 1-bit counter, 
with the output used as the enable of the row and column counter. After the 
matrix is traversed around once, the state machine sends a signal to 
decrement the count in the length counter. When the counter reaches 1, and 
the final index table address is sent to the operand interface unit 1482, the 
state machine asserts a final signal, and resets the start bit. 

Blend generation state machine 1488 is responsible for controlling 
arithmetic unit 1496 to generate a sequence of numbers from 0 to 255 for the 
length of a blend. This sequence of numbers is then used as the interpolation 
factor to interpolate between the blend start value and blend end value. 

Blend generation state machine 1488 determines which mode it 
should run in (jump mode or step mode). If the blend length is less than or 
equal to 256, then jump mode is used, otherwize step mode is used. 

The blend generation state machine 1488 calculates the following and 
puts them in registers (regO, regl, reg2). If a blend ramp is in step mode for a 
predetermined length, then latch 511-length in regO (24 bits), 512 - 2*length 
in reg 1 (24 bits), and end-start in reg 2 (4x9 bits). If the ramp is in jump 
mode, then latch 0 into regO, 255/(length-l) into regl, and end-start into reg2 
(4x9 bits). 

In step mode, the following operations are performed for every cycle: 
If regO > 0, then add regO with reg 1 and store the result in regO. 
Another incrementor can also be enabled so its output is incremented by 1 . If 
regO <= 0, then add regO with 510 and store the result in regO. Incrementor is 
not incremented. The output of the incrementor is the ramp value. 
In jump mode, the following is done for every cycle: 
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Add regO with regl. The Adder output is 24 bits, in fixed point format of 16.8. 
Store the adder output in regO. If the first bit of fraction result is 1, then 
increment the integer part. 

The least 8 bits of the integer part of the incrementor is the ramp 
value. The ramp value, the output of reg2, and the blond stnrt value is then 
fed into the image data processor 1462 to produce the ramp. 

Matrix multiplication state machine 1490 is responsible for 
performing linear color space conversion on input data objects using a 
conversion matrix. The conversion matrix is of the dimension 4X5. The first 
four columns multiply with the 4 channels in the data object, while the last 
column contains constant coefficients to be added to the sum of products. 
When the START signal from control bus 1515 is asserted, matrix 
multiplication state machine does the following: 

1) It generates line numbers jto fetch constant coefficients of the 
conversion matrix from buses 1482 and 1484. It also enables miscellaneous 
register 1498 to store these constant coefficients. 

2) It contains a 1-bit flipflop, which generates a line number 
which is used as an address to fetch half of matrix from buses 1482 and 1484. 
It also generates a w MAT_SEL n signal that selects which half of the data 
object to be multiplied with that half of matrix. 

3) It finishes when there is no data objects coming from data 
object interface unit 1480. 

Interpolation state machine 1494 * is responsible for performing 
horizontal interpolation of data objects. During horizontal interpolation, 
main data path unit 242 accepts a stream of data objects from bus 1451, and 
interpolates between adjacent data objects to output a stream of data objects 
which is twice or 4 times as long as the original stream. Since the data 
objects can be packed bytes or pixels, interpolation state machine 1494 
operates differently in each case to maximize the throughput. Interpolation 
state machine 1494 does the following: 
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1) It generates INT_SEL signal to data distribution logic 1503 to 
rearrange the incoming data objects so that the right pair of data objects are 
interpolated. ■ 

2) It generates interpolation factors to interpolate between 
adjacent pairs of data objects. 

3) It generates a STALL signal to stop data object interface unit 
1480 from accepting more data objects. This is necessary as the output 
stream is longer than the input stream. The STALL signal goes to flow bus 
1510. 

Arithmetic unit 1496 contains circuitry for performing arithmetic 
calculations. It is configured by control signals on control bus 1515. It is 
used by two instructions only: affine image transformation and convolution, 
and blend generation in compositing. 

In affine image transformation and convolution, arithmetic unit 1496 
is responsible for: 

1) Calculating the next x and y coordinates. To calculate x 
coordinates arithmetic unit 1496 uses an adder/subtractor to add/subtract the 
x part of horizontal and vertical delta to/from the current x coordinate. To 
calculate the y coordinates arithmetic unit 1498 uses an adder/subtractor to 
add/subtract the y part of the horizontal or vertical delta to/from the current y 
coordinate. 

2) Adding the y coordinate to the index table offset to calculate 
the index table address. This sum is also incremented by 4 to find the next 
index table entry, when interpolation is used to find true value of a pixel. 

3) Adding the x coordinate to the index table entry to find the 
address of the pixel. 

4) Subtract 1 from the length count. 

In blend generation, arithmetic unit 1496 does the following: 
1) Tn step mode, one of the ramp adders is used to calculate an 
internal variable in the ramp generation algorithm, while the other adder is 
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used to increment the ramp value when the internal variable is greater than 
0. 

2) In jump mode, only one of the adders is required to add the 
jump value to the current ramp value. 

3) Round off fractions occur in jump mode. 

4) Subtract start of blend from end of blend at the beginning of 
ramp generation. 

5) Subtract one from the length count. 

Miscellaneous register 1498 provides extra storage space apart from 
the data registers in data object interface unit 1480 and operand interface 
units 1482 and 1484. It is usually used to store internal variables or as a 
buffer of past data objects fern data object interface unit 1480. It is 
configured by control signals on control bus 1515. 

Data synchronizer 1500 is configured by control signals on control bus 
1515. It provides STALL signals to data object interface unit 1480 and 
operand interface units 1482 and 1484 so that if one of the interface units 
receives a piece of data object others have not, that interface unit is stalLed 
until all the other interface units have received their pieces of data. 

Data distribution logic 1505 rearranges data objects from data bus 
1510 and register file 1472 via bus 1530 in accordance with control signals on 
control bus 1515, including a MATJ3EL signal from matrix multiplication 
state machine 1490 and a INT_SEL signal from interpolation state machine 
1494. The rearranged data is outputed onto bus 1461. 

Fig. 131 illustrates image data processor 1462 of Fig. 129 in further 
detail. Image data processor 1462 includes a pipeline controller 1540, and a 
number of color channel processors 1545, 1550, 1555 and 1560, All color 
channel processors accept inputs from bus 1565, which is driven by the input 
interface 1460 (Fig. 131). All color channel processors and pipeline controller 
1540 are configured by control signals from control signal register 1470 via 
bus 1472. All the color channel processors also accept inputs from register 
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file 1472 and ROM 1475 of Fig. 129 via bus 1580. The outputs of all the 
color channel processors and pipeline controller are grouped together to form 
bus 1570, which forms the output 1455 of image data processor 1462. 

Pipeline controller 1540 controls the flow of data objects within all the 
color channel processors by enabling and disabling registers within ail the 
color channel processors. Within pipeline controller 1540 there is a pipeline 
of registers. The shape and depth of the pipeline is configured by the control 
signals from bus 1471, and the pipeline in pipeline controller 1540 has the 
same shape as the pipeline in the color channel processors. The Pipeline 
controller accepts VALID signals from bus 1565. For each pipeline stage 
within pipeline controller 1540, if the incoming VALID signal is asserted and 
the pipeline stage is not stalled, then the pipeline stage asserts the register 
enable signals to all color channel processors, and latch the incoming VALID 
signal. The output of the latch then a VALID signal going to the next 
pipeline stage. In this way the movement of data objects in thg pipeline ie 
simulated and controlled, without storage of any data. 

Color channel processors 1546, 1550, 1555 and 1560 perform the 
main arithmetic operations on incoming data objects, with each of them 
responsible for one of the channels of the output data object. In the preferred 
embodiment the number of color channel processors is limited to 4, since most 
pixel data objects have a maximum of 4 channels. 

One of the color channel processors processes the opacity channel of a 
pixeL There is additional circuitry (not shown in Fig. 131), connected to the 
control bus 1471, which transforms the control signals from the control bus 
1471 so that the color channel processor processes the opacity channel 
correctly, as for some image processing operations thG operations on the 
opacity channel is slightly different from the operations on the color channels. 

Fig. 132 illustrates color channel processor 1545, 1550, 1555 or 1560 
(generally denoted by 1600 in Fig. 132) in further detail. Each color channel 
processor 1545, 1550, 1555 or 1560 includes processing block A 1610, 



-455- 



164- 19 



processing block B 1615, big adder 1620, fraction rounder 1625, clamp-or- 
wrapper 1630, and output multiplexer 1635. The color channel processor 
1600 accepts control signals from control signal register 1470 via bus 1602. 
enable signals from pipeline controller 1540 via bus 1604, information from 
register file 1472 via bus 1605, data objects from other color channel 
processor via bus 1603, and data objects from input interface 1460 via bus 
160 L 

Processing block A 1610 performs some arithmetic operations on the 
data objects from bus 1601, and produces partially computed data objects on 
bus 1611. The following illustrates what processing block A 1610 does for 
designated image processing operations. 

In compositing, processing block A 1610 pre-multiplies data objects 
from data object bus 1451 with opacity, interpolates between a blend start 
value and a blond end value with an interpolation factor from input interface 
1460 in Fig. 129, pre-multipliee operands from operand bus 1452 in Fig. 129 
or multiplies blend color by opacity, and attenuates multiplication on pre- 
multiplied operand or blend color data. 

in general color space conversion, the processing block A 1610 
interpolates between 4 color table values using two fraction values from bus 
1451 in Fig. 129. 

In afiine image transformation and convolution, the processing block A 
1610 pre-multiplies the color of the source pixel by opacity, and interpolates 
between pixels on the same row using the fraction part of current x-coordinate. 

In linear color space conversion, the processing block A 1610 pre- 
multiplies color of the source pixel by opacity, and multiplies p re-multiplied 
color data with conversion matrix coefficients. 

In horizontal interpolation and vertical interpolation, the processing 
block A 1610 interpolates between two data objects. 

In residual merging, the processing block A 1610 adds two data 

objects. 
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Processing block A 1610 includes a number of multifunction blocks 
1640 and processing block A glue logic 1645. The multifunction blocks 1640 
are configured by control signals, and may perform any one of the following 
functions: 

add/subtract two data objects; 

passing one data object; 

interpolate between two data objects with a interpolation factor; 
pre-multiply a color with an opacity; 

multiply two data objects, and then add a third data object to the 
product; and 

add/subtract two data objects, and then pre-multiply the 
sum/difference with an opacity. 

The registers within the multifunction blocks 1640 are enabled or 
disabled by enable signals from bus 1604 generated by pipelined controller 
1540 in Fig. 131. Processing block A glue logic 1645 accepts data objects 
from bus 1601 and data objects from bus 1603, and the outputs uf some of the 
multifunction blocks 1640, and routes them to inputs of other selected 
multifunction blocks 1640. Processing block A glue logic 1645 is also 
configured by control signals from bus 1602. 

Processing block B 1615 performs arithmetic operations on the data 
objects from bus 1601, and partially computed data objects from bus 1611, to 
produce partially computed data objects on bus 1616. The following . 
description illustrates what processing block B 1615 does for designated 
image processing operations. 

In compositing (with non-plus operators), the processing block B 1615 
multiplies pre-processed data objects from data object bus 1451 and 
operands from operand bus 1452 with compositing ^multiplicands from bus 
1603, and multiplies clamped/wrapped data objects by output of the ROM, 
which is 255/opacity in 8.8 format. 
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In compositing with plus operator, the processing block B 1616 adds 
two pre-processed data objects, hi the opacity channel, it also subtracts 255 
from the sum, multiplies an offset with the difference, and divides the product 
by 255. 

Li genera] color space conversion, the processing block B 1615 
interpolates between 4 color table values using 2 of the fraction values from 
bus 1451, and interpolates between partially interpolated color value from 
processing block A 1610 and the result of the previous interpolation using the 
remaining fraction value. 

In affine image transformation and convolution, the processing block B 
1615 interpolates between partially interpolated pixels using the fraction 
part of current y-coordinate, and multiplies interpolated pixels with 
coefficients in a sub-sample weight matrix. 

In linear color space conversion, the processing block B 1615 pre- 
multiplies the color of the source pixel by opacity, and multiplies pre- 
multiplied color with conversion matrix coefficients. 

Processing block B 1615 again includes a number of multifunction 
blocks and processing block B glue logic 1650. The multifunction blocks are 
exactly the same as those in processing block A 1610, but the processing block 
B glue logic 1650 accepts data objects from buses 1601, 1603, 1611, 1631 and 
the outputs of selected multifunction blocks and routes them to the inputs of 
selected multifunction blocks. Processing block B glue logic 1650 is also 
configured by control signals from bus 1602. 

Big adder 1620 iB responsible for combining some of the partial 
results from processing block A 1610 and processing block B 1615. It accepts 
inputs from input interface 1460 via bus 1601, processing block A 1610 via 
bus 1611, processing block B 1615 via bus 1616, and register file 1472 via bus 
1605, and it produces the combined result on bus 1621. It is also configured 
by control signals on bus 1602. 
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For various image processing operations, big adder 1620 may be 
configured differently. The following description illustrates its operation 
during designated image processing operations. 

In compositing with non-plus operators, the big adder 1620 adds two 
partial products from processing block B 1615 together. 

In compositing with plus operator, the big adder 1620 subtract the 
sum of pre-processed data objects with offset from the opacity channel, if an 
offset enable is on. 

In affine image transformation/convolution, the big adder 1620 
accumulates the products from processing block B 1615 

In linear color space conversion, in the first cycle, the big adder adds 
the two matrix coefficients/data object products and the constant coefficient 
together. In the second cycle, it adds the sum of last qycle with another two 
matrix coefficients/data object products together. 

Fraction rounder 1625 accepts input from the big adder 1620 via bus 
1G21 and rounds off the fraction part of the output. The number of bits 
representing the fraction part is described by a BP signal on bus 1605 from 
register file 1472. The following table shows how the BP signal is 
interpreted. The rounded output is provided on bus 1626. 



Table 27: Fraction Table 



bp field 


Meaning 


0 


Bottom 26 bits are fractions. 


1 


Bottom 24 bite are fractions. 


2 


Bottom 22 bits are fractions. 


3 


Bottom 20 bits are fractions. 


4 


Bottom 18 bits are fractions. 


5 


Bottom 16 hits are fractions. 


6 


Bottom 14 bits are fractions. 


7 


Bottom 12 bits are fractions. j 



As well as rounding off fraction, fraction rounder 1625 also does two 

things; 
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1) determines whether the rouncted result is negative; and 

2) determines whether the absolute value of the rounded result is 
greater than 255. 

Clamp-or-wrapper 1G30 accepts inputs from fraction rounder 1625 via 
bus 1626 and does the following in the order described: 

fmde the absolute value of the rounded result, if such option is 
enabled; and 

clamps any underflow uf the data object to the minimum value of the 
data object, and any overflow of the data object to the maximum value of the 
data object. 

Output multiplexer 1635 selects the final output from the output of 
processing block B on bus 1616 and the output of clamp-or-wrapper on bu6 
1631. It also performs some final processing on the data object. The 
following description illustrates its operation for designated image processing 
operations. 

In compositing with non-plus operators and un-pre-multiplication, the 
multiplexer 1635 combines some of the outputs of processing block B 1615 to 
form the un-pre-multiplied data object. 

In compositing with non-plus operator and no un-pre-multiplication, 
the multiplexer 1635 passes on the output of clamp-or-wrapper 1630. 

In compositing with plus operator, the multiplexer 1635 combines 
some of the outputs of processing block B ,1630 to form resultant data object. 

In general color space conversion, the multiplexer 1635 applies the 
translate-and-clamp function on the output data object. 

In other operations, the multiplexer 1635 passes on the output of 
clamp-or-wrapper 1630. 

Fig. 133 illustrates a single multifunction block (e.g. 1640) in further 
detail. Multifunction block 1640 includes mode detector 1710, two addition 
operand logic unite 1660 and 1670, 3 multiplexing logic units 1680, 1685 and 
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1690, a 2-input adder 1675, a 2-input multiplier with 2 addends 1695, and 
register 1705. 

Mode detector 1710 accepts one input from control signal register 
1470 ; in Fig. 129 the MODE signal 1711, and two inputs from input interface 
1460, in Fig. 129 SUB signal 1712 and SWAP signal 1713. Mode detector 
1710 decodes these signals into control signals going to addition operand logic 
units 1660 and 1670 ; and multiplexing logic units 1680, 1685 and 1690, and 
these control signals configure multifunction block 1640 to perform various 
operations. There are 8 modes in multifunction block 1640: 

1) Add/sub mode: adds or subtract input 1655 to/from input 1665, 
in accordance with the SUB signal 1712. Also, the inputs can be swapped in 
accordance with the SWAP signal 693. 

2) Bypass mode: bypass input 1655 to output. 

3) Interpolate mode: interpolates between inputs 1655 and 1665 
using input 1675 aa the interpolation factor. Inputs 1655 and 1665 can be 
swapped in accordance with the SWAP signal 1713. 

4) Pre-multiply mode: multiplies input 1655 with input 1675 and 
divide it by 255. The output of the INC register 1708 tells the next stage 
whether to increment the result of this stage in bus 1707 to obtain the correct 
result. 

5) Multiply mode: multiplies input 1655 with 1675. 

6) Add/sub tract-and-p re-multiply mode: adds/subtracts input 1665 
to/from input 1655, multiplies the sum/difference with input 1675, and then- 
divide the product by 255. The output of the INC register 1708 tells the next 
stage whether to increment the result of this stage in bus 1707 to obtain the 
correct result. 

Addition operand logic units 1660 and 1670 find one's complement of 
the input on demand, so that the adder can do subtraction as well. Adder 
1675 adds the outputs of addition operand logic 1660 and 1670 in buses 1662 
and 1672 together, and outputs the sum in bus 1677, 
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Multiplexing logic 1680, 1685 and 1690 select suitable multiplicands 
and addends to implement, a desired function. They are all configured by 
control signals on bus 1714 from mode detector 1710. 

Multiplier with two addends 1695 multiplies input from bus 1677 
with input from bus 1682, then adds the products to the sum of inputs fern 
buses 1687 and 1692. 

Adder 1700 adds the least significant 8 bits of the output of 
multiplier 1695 with the most significant 8 bits of the output of multiplier 
1695. The carrynut of adder 1700 is latched in INC register 1701. INC 
register 1701 is enabled by signal 1702, Register 1705 stores the product 
from multiplier 1695. It is also enabled by signal 1702. 

Fig. 134 illustrates a block diagram for the compositing operations. 
The compositing operation accepts three input streams of data: 

1) The accumulated pixel data, which is derived from the same 
location as the result is stored to in this accumulator model. 

2) A compositing operand - which consists of color and opacity, The 
color and opacity can both be either flat, a blend, pixels or tiled. 

3) Attenuation - which attenuates the operand data. The 
attenuation can be flat, a bit map or a byte map. 

Pixel data typically consists of four channels. Three of these 
channels make up the color of the pixel. The remaining channel is the opacity 
of the pbcel. Pixel data can be pre-multiplied or normal. When pixel data is 
pre-multiplied, each of the color channels are multiplied with the opacity. 
Since equations for compositing operators are simple with pre-multiplied 
pixels, usually pixel data is pre-multiplied before it is composited with 
another pixel. 

The compositing operators implemented in the preferred 
embodiments are shown in Table 1. Each operator works on pre-multiplied 
data, (a^, a 0 ) refers to a pre-multiplied pixel of color a c and opacity a^ r is 
the "offset" value and wcQ is the wrappingfelamping operator the reverse 
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operator of each of the over, in, out, atop operators in Table 1 is also 
implemented, and the compositing model has the accumulator on the left. 

Composite block 1760 in Fig. 134 comprizes three color sub-blocks 
and a opacity sub-block. Each color sub-block operates on one color channel, 
and opacity channel of the input pixels to obtain the color of the output pixel. 
The following pseudo code shows how this is done. 

PIXEL Composite IN colorA, colorB: PIXEL; 

IN opacityA, opacityB: PIXETj; 

IN oomp_op: 

COMPOSITE J3PERATOK 

) 

( 

PIXEL result; 

IF cump_up is rover, rin, rout, ratop THEN 

swap colorA and colorB; 

swap opacityA and opacityB; 
END IF; 

IF comp-op is over or rover or loado or plus THEN 
X=1; 

ELSE IF comp_op is in or rin or atop or ratop THEN 

X - opacityB; 
ELSE IF comp-op its out or rout or xor THEN 

X = not(opacityB); 
ELSE IF comp-op is loadzero or loadc or loadco THEN 

X = 0 
END IF; 

IF comp-op is ovor or rover or atop or ratop or xor THEN 

Y= not(opacitya); 
ELSE IF comp_op is plus or loadc or loadco THEN 
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Y= not (opacity a); 
EI5E IF comp_op is plus or loadc or loadco THEN 
Y=l; 

ELSE IF comp-op is in or rin or out or rout or 

loadzero or loado THEN 

Y=0 
END EF; 

result = colorA * X + colorB * Y; 
RETURN result; 

The above pseudo code ia different for the opacity sub-block, since the 
operators loade* and loado' have different meaning in the opacity channel. 

Block 1765 in Fig. 134 is responsible for clamping or wrapping the 
output of block 1760. When block 1765 is configured to clamp, it forces all 
values less than the miniinum allowable value lo the minimum allowed value, 
and all values more than the maximum allowed value to the maximum 
allowed value. If block 1765 is configured to wrap, it calculates the following 
equation: 

((x-min) mod (max-min)) + mia, 

whereby min and max are the minimum and maximum allowed value 
of the color respectively. Preferably the minimum value for a color is 0, and 
the maximum value is 265. 

Block 1770 in Fig. 134 is responsible for un-pre-multiplying the result 
from block 1765. It un-pre-multiplies a pixel by multiplying the pro- 
multiplied color value with 255/o, where o is the opacity after composition. 
The value 255/o is obtained from a ROM inside the compositing engine. Tbe 
value stored in the ROM is in the format of 8.8 and the rest of the fraction is 
rounded. The result of multiplication is stored in the format of 16.8, The 
result would be rounded to 8 bits to produce the un-p re -multiplied pixel. 



-464- 



164 - 28 



Blend generator 3721 <™„ , * 



1) ramp generation, an d 

2) interpolation 



In ramp generation the com • 

-V-n'toao, oTT"^* m " mU ' 3 «— * 

- «~ ttan o r e, M1 t0 26s , ^ * f"* **. whan the leneth 

«<7 ^ PMiod . "* <— — t, at m „ 8l „ n8 

"jy^ til© 

" ^J^^in^r ;r r the rom to cnd - 

^accumulator. The output of th, " SValU6is ^^dedtoa 16- 

- * r seq _. * a bits to form 

aWhm simi i ar ' the Siting engine uses an 

Mowing pseudocode . ^ ShMdra - n ^^.a 8 d Cficribednyt , e 

VoidlinedrawC length: INTE GER 



d = 6M- length; 
incrE = 510; 

^NB = 512-2*length; 
ramp - 0; 

f or(i=0;i(length ; i-H-) 
{ 

tfd(=0then 

d -*•= uicrE; 

else { 
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d += iacrNE; 
ramp++ ; 

} 

> 



20 



After tte fe|bw . ng e?ua 
&»n. the ramp, calculated to generate the blend 

The, BlMd = <<Md - Ste '»'™ 5 „ start 
Ihe <t TO s,on by 255 ^ m 
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» ~n a. edBM Qf t|K ^ * « tri W 
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255 if(192(x) 
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^ pixel by pixeI . ^° latl ° n t0 the output 
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12 IT r""" «««•«. ta. aninput inde, 

able „„ md pMs ^ iaa8e l8ifl ^ adtess P »te 

ng. Id8. The kernel descriptor describes: 

1) Soume image start ^ 
resoluhon). Lection (0,0) is at ihe top left „ the imaBe 

2) Horizontal and vertical sub-sample deltas^'* ~, , 

point, 24.24 resolution). ( complement fixed 

" fixed V SbitbPSeMdefi " i,18theIO0ati ™ Ofthebi ^^^n th e 

lTd :t oo - d ~* The - d - - *. bP 

neld is shown in Fig. 150. 

4) Accummation matrix cedents. These are of "variable" point 
« point implicitly specified by the bp field. 

5) anrl field that indicates the remaining number of words in the 
^ deeenptor. This val „e „ e qU a! to the number o f rows time, 
number of columns minus 1. 

For the short kernel descriptor, apart from the integer par. of start x 
-rduiate, the other parameters are assumed to have the Motving values 
starting x coordinate fraction <-0, 
starting y coordinate <-0, 
horizontal delta <-l.o, 
vertical delta<-1.0. 

After add rcee ^ 1M1 „ ft 

--hisint.di.ferentways.depend^outhedimeusio: 
ft eubsamp,ema.Hx. ^he dimensions of the subeamp,e matrixaM , x 
1, Senear «, adds the hoi^l delta to the cmreu, coordinates 

until it has generated enough coordinates. 



-468- 



1B4-32 



it d. dune*,,™ of a, sub3ampIe ma(ijx are 

— «1 add, *e HorW. deIta „ fc _ M MMdj 

. lidre5S ^ 1831 subtects fte 

After that addrese IgJ1 adds ^ 

and the PTOadllre is ' °™* 

. ^ of the ;:: - * « 

current , and v " ^ e,8te ' 8aWre4,te « , » i »»«l» 
«— » and 7 ooerdtnatee „ fc 
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wl PK<|1 flDm "» source image for 2 if 
K« are intonated, h C aee waea BMra pise , 8 are J^. 

I! 1 - ~ 1831 -* * - ne. indoles 

and two more pixels are obtained. 
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matrix tor the preyious output pixel. 
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The middle diagrams in Pig. 139 ii Wates ^ 
^-multiplier 1832 multiplies the rolor channels ^ thfi op 
channel of the pixel if required. 

labiate XB32 interpolate, between source pixeh „ fi „ d fc ^ 
c >0 of the prra, quired. B^pi^ft^^^ 

^fT T,,enitin ~ 8b ~ P^uaingths fraction 

Part of the current , coordinate andput* the result in a register. After ^ 
« "bta lns the two pixels on the next ^ fam ^ ^ ^ • 
Then rt inflow between those two pi*e le ming the same , 
After ihat^nterpolator 183S uses the faction part of the c TOlt , 

mterpolate between this inte.po.aW tee* and the last interpolated 
refauix. 

Accumulator 1834 does two t hings: 

1) it multiplies the matrix coefficients with the pixel, and 

2) i, accumujates the product abo TC until the whole matrix is 
traversed. Then it outputs a value to the next stage. 

Preferably the accumulator 18S4 can be initialized with 0 or a special 
value on a channel-by-channel basis. 

Block 1835 rounds the output „f 18M< lhen damp8 

underflow, or « m ^ ^ tf 

and finds the absolute value o, .ho output if re ,ui re d. The looatjon rf ' 
btnarvpoint within the output of the accumulate™ specific hv the bp field 
in the brne. descriptor. The bp field indicates the number of leading bits 
» the accumulated result to discard. This is shown in the bottom diagram of 
Fig. 139. Note that the accounted value is treated as a Big ne d two's 
complement number. 

Yet another image procsssmgoperation that the main data path unit 
242 can perform is mat* multiplication. Matrix Multip licati on is used for 
color space conversion where an affine rslationship exists between the two 
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"** l e ' the mafc nx coefficients on line- fli^i, 

added togcthor « nd 8torei . ron6tdnt tetm » -«* « then 

rou-dod 1855 to 8 bits. " , ' m ° f ,he adder » 'hen 
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multiplier eo th., «. ■ * rearrange the outputs of the 

Apart ton, ^ ima6e • 
Path „„ it 242 M * Mnfiplred „ ^ otheropeMtiona ' — 

The ta^, d68cripti(jii fi ^ arcbitBctTO 

capable of performine variolic i m . ""lecture that u 

the Oes lgn nainlr eoatprt^ . nilmber of ^ m J" * 

' tteaw, effort ia reduced aignificntij,. » Blocks, the 

3.18.6Data Cache Controller and Cache 

:r wire: 

tnao . cacne memory is commonly refers 

to as a cache-line. The carl™ m o reierred 

cache-hnee. * ~ S ' ■** ~*« - «* 

The data cache controller 240 services a»t- a 

operand organs 247 248 It first IT ^ ^ ^ 

. It first checks to see if the data is resident in 
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coh. 230. U not, data «, he fetcb8d tom ^ 

1! Z ron T r240to operateina — — 

-odea. There a re als „ spBcial aldlessjng modes where , he J 

. ^^^^^^^ T j «2 

aUo involve suppWng „ „ eigh , WMds (266 ^ J - 

oWKM8 247,248si m »ltaacoualy. 

H. cache RAM is oreamMd „ „ memMy 

mto 256 Ma. ^ arrlng!ment a]so aaows ^ (o ^ 
teaar. 1 ceda lmul ta J ,eo„ s l J .i„ lBVCOIIlelh)mdifrereiitbaiifcs 

cache epe rates in the Mtowing moaee , wUch wiu be aiBm> 
-» y .ate, PreferaWy , it is ^ (o , - » 

is cache if this is desired. " entire 

1. Normal Mode 

2- Single Output General Color Space Conversion Mode 

3- Multiple Output General Color Space Conversion Mode 
4. JPEG Encoding Mode 

0 5. Slow JPEG Decoding Mode 

6. Matrix Multiplication Mode 

7. Disabled Mode 

8. Invalidate Mode 

B. .4, shows the address, data and con tro , fl ow of the data cache 
centred 240 and data eaehe 230 shown in fgnna 2 

The data cache 230, eonsiau of a direct ntapped cache of the type 
~» diseased. The data cache „r 240, „ mitts of 
memo,y 1872 havingatag entry for each cache JiM , which tege,,^ / 
>* P art of the e*e,na, n,e m o iy add ress that the eajllle 

- — * -apped to. Th ere „ al80 . UlK valid 8tatU3 „ 1 87 3 , 
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indicate whether the cumnt cache-line is valid am i. ■■ 
invalid. •TOBWM. All cache-hnes are initially 



The data cache centred 24 „ „ 8ervjC8 ^ reqMsts 

B 247 (Ks 2> and operend ^ C 248 ^ , ^ > 
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*e ta g ^ 1872 entriM ferth9 laKaddrosM6 of J 
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; e ^~;-^--»-4 8t a.,ha^ t h th e requestedaata lm . 

fetched ftont extereal ^ ^ an inpat bu8 
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Ttnuin, „a w „ Fig . I42 wWch s]Mws memoiy 

I 8 r Tr ^ ^ 2M ' S - * ™ — 

wth ,„ M0hc . lmes C0 ,... iC127 am( a cacie Jine ^ ^ 

cache BAM cooaiate „ f . e „ addreaaabIe £ 



-474- 



164-38 



each having 128 bank-lines of 32 bits with P », h„ v ,• ^. 

,b ' mtb edch cache-line Ci consisting of the 
correspond 8 bank-hnes BOi B7i of the 8 ^ ^ ^ 

The compel of the generated mmplete ^ ' ^ 
kown m Fte 143 . The crated adaMSS „ , ^ J 

^ " *" ^ ^ 1872 ' * «* *- * used fbr 
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T-nngnow to Fig. ,44, whieh ehowe a Meek diagram of the data ^ 
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memory banks of cache memory .30. In this way, eight different bank-hne, 
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of data can be simultaneously Mad a,. eigte rcad data porte (read , b 
reaponaeto reaped B. addres8e8 8upplfe<1 Qn ^ ^ read ^ 
(«.<Uddr) of the memory banks of the each, memoty 230 

Each bank of the cache memory 230 has to own programmable addrea9 
• «~ 1M1 . This allows eight diW locations ,„ be simuJ 

accasse. from the respect eight banks of memory. Each address generator 
1881 has a dcc-tnode i. put fcl settiag [he mode rf rf ^ 

«. 188!, an index-packet input, a base-addreas inpnt an d an addreaa 

«** The modes of operation of the programmable addreaa generator 
10 1881 include 

(a) Kandom access mode where a signal en the dec-mode inpnt acts 
each address mmtK 1881 t „ the random arass ^ 

external memory addresses, are supplied on the index-packe. i„put( e) and 
outputted on the addresa output of one or mora of the a d dreea generator. 
1881; and 

00 JPEG ence d ing and decoding, co.or apace conversion, and matrix 
multiplication modes, whe re . signa j oa the dcc . mode ^ ^ ^ 
generator l881 to tho appiDptia(c ^ fc ^ ^ ^ ^ 

generator 1881 receivea an index on the index-packet input and generates an 
mdexaddree, The index addresses are then added to a fiffid base address 
supplied on the baae-add™ input resulting in a complete external mantcry 
addresa which is then outputted on the address output. Depending „ pon the 
mode of operation, the addresa generate, are able to generate up to eight 
different complete external memory addresses. 

The eight addreaa generator* 188! ron5ist „ ^ 

combtnationaJlogicctadtaaaohhavingaa their inputs; a Waddreaa, a dec 
mode and aa urdex and each ^ „ „ ^ ^ 

an output. 
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A base-address register 1885 stores the current base address that is 
combined with ,he indo* padto, and a de-mode register 1888 stores ^ 
current operational mode (decmode) of the data cache controller 240 

The tag memory 1872 compri.es one block of 123 by 20 bit, muM- port 
* RAM. This RAM has one write port (updato.line.addr), one write enable pot, 

(write), eight read ports (readOline-addr. read71ine-addr) and eight read 

eutput ports (tagO.data ta^data). This enables e ie ht simultaneous 

lookups on the port, <read01ine.addr,..«ad71i M .addr) by the eight address 
orators 1881 to determine, for each line address of the one or more 
• crated memory addresses, the tag addresses currently stored for those 
hnea. The current tag addressee for those lines are outputted on ,hs ports 
(tag0.data....,ag7.data) to the tag comparator 1886. When required a tag 
write s Ig nal is generetod by the cone controller block 187S for supply to the 
wnte port (write) of the tag memory 1872 to enable writing to the tog memory 
15 1872 on the port (update-line-addr). 

A 128-bit line valid memory 1873 contains the line valid status for each 
cache-hne of the cache memory 230. This is 128 by I bit memory with one 
wnte port (update-line-addr), one write enable port (update), eight read ports 
(read01ille - addr and eight read output ports 
(Iinevalid0,..ainevalid7). M a similar manner to the tag memory, this allows 
eight simultaneous lookups on the ports («ad01xne-addr,...,read7hne-addr) by 
the eight address generators 1881 to determine, for each line address of the 
one or more generated memory addresses, the line valid status bits currently 
stored for those lines. The current line valid bits for those lines are 
outputted on the ports (Iinevalid0,...,linevalid7) to the tag comparator 1886 
When required, a write signal is generated by the cache controller block 1878 
for supply to the write port (update) of the line valid status memory 1873 to 
enable writing to the line valid status memory 1873 on the port (update-line- 
addr). 
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15 



The tag comparer w 1886 ron8ista rf ^ 
comparator* ha ™ s .ag.dau inputs fe rcepectiTOly 

me addressee of the currently „ d „ mplete ^ a 
t a£ .ad,. CIW „ tefolrespectivelyreceiviiigthetasa[ldressijsofthe ^ 

POUM complete externa] memory addreases , . dcc _ taplit fc r re 
current operations, m<)d8 ^ ^ ^ fc ^ ^ ^ ^ ^ 
addresses to be Mmpared , ^ . IbM vn] . d , nput fc r ^ ^ ^ 

status bite e^rently 8t0md „ ^ „ ^ Btalufl mcmMy ]873 ^ 
■ lines .ccesaed by the ,i„ 8 addre88es of ^ 

vernal memory addresses. The ^mparator block 1886 ha, eight hit 
outputs ft, each of the eight address generators 1881 . A Wt 8ign „ „ 
asserted when the tag add™ of the generated complete externa, memory 
add... nratchee the content* of the tagn.en.ory 1 872 at ths 
by the fine a ddres 8 of the generated compete externa, memory eddress, atK. 
the Une valid status hi, 1878 for that line i. averted. En this particm „ 
embodiment, the data struck, at0 red ;„ ext8raa , me ■ ^ are ^ ^ 
hence the moat eignificant bite of the tagaddreaeea are the S ame Thna ft b 
preferable to compare only those leae, signifies hita of the tag addresses 
wind. m ay yary. This is acUeved by ,„«, ^ ^ ^ 

(dccmode) sotting the tag eompnrator 1 886 for comparing tiose teast 

significant bits of the tag addresses which may vary. 

The cache controller 1878 accept, a request (pmc_re^ 1876 fern the 
operand B 247 or operand C 248 and acknowledges fercc ack) 1879 thia 
request if the data i, .vaUable in cache memory 230. Depending on the 
mode of operation, up to eigh, differently addressed data items may be 
rented, one from each of the sight banks of cache memory 23P Tto 
requested data is a ™ Uabls in cache ^ ^ ^ ^ 

1886 assorts a hi, for that line of memory. The cache controller 1878 in 
response to the asserted hit signs, a ^ ^ 
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signal on the port (cache . read> „ enabling ^ rf ^ ^ ^ ^ 
wh,ch ft. hi, signs, been asseifed ^ n a Kquest ^ ig76 
aeserfed, but not tho nit eign.1 0u t 0,...,hi, 7) , . 89Mrated ^ 

i89c » sent to a. „ memoiT ^ w . th ^ ^ 

memory address fa tha t rf data ^ . s wriura ^ 

tha eight banks of cache memory 230 ™ tbe inpu, <exUiata> when i, » 
arable fa. the eMeraa , memMy ^ ^ ^ ^ 

Nation is also written into the te 8m<lmoly 18g6 M ^ ^ 
and tho hno status bit 1873 for that line asserted. 
- Data tarn ft. eighl baaks rf ^ ^ fc ^ 

through a series of multipara in a data or.ani.er 1 892 . so tte data „ 
posmoned ina predetermined manneriuanoutput data packet. m S 4 In one 
operational mode, the date organizer 1892 ia ah!e t o select and output eight 
8-b.t words from the respective eight 32-bit words outputted from the eight 
» memory banks by ^ mag fc ^ ^ ^ ^ 

and the byte addressee <byte.addr, of the current generated complete ertemal 
memory addressee. In another operational mode, the data organic 1B92 
direct outputs the eight 32-bit word, outputted from the eight memory 
ban**. Ae noted previous*, th , data 0 ^ nfaer ^ ^ # 

predetermined manner for output. 

A request would comprize the following steps: 

1) The processing unit requests a packet of data by supplying an 
address to the processing unit interface ofthe cache controller 1878; 

2) Each of the eight address generator units 1881 then generate a 
separate add^s f or each block of cache memory depending on the mode of 
operation; 

3) The Tag portion of each of the generated addresses is then 
compared to the Tag addrese etored in the four blocks of triple.port Tog 
memory 1S86 and addressed by each ofthe corresponding fc. part „, the 
eight generated addresses; 
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4) f'^'»»td, lu ,d,he 1 i mwIi(letMu8l87 3 fot , hat 
-erted, the data Rested tor blodI of memow js deemEd ^ 
resident in the said cache memory 230; 

5) Da,. that „ mt ^ i8 fetched ^ extMMi ^ 

aU e*h, U o*s of, he cache m e nory 230 are updated with ^ ^ rf 4>ta 

^ ~ »• *« **~ - *• - ^a is a. ^ to 
lor that hue asserted; 

8, When all reqiM8ted data i fcmsaK!rcaldentll , coclK 230 it 

- '"P" M ° te ^^P-»»ia e ;.» i t in „ predetermill . dp>oketf()nnal ' 
A, pre v,„„» ly noted , .„ t]K modt]M (Rg 2) of 

tneh.de a standard cB„e interface 308 (Fi s 20). For m detaiIs „ fc 
standard cBue ft**. registem for the data ^ ^ ^ ^ ^ 
230, reference is raa de to page, B42 to B46 of Appendix B. The setting in 

« th " e ^'^oo« ro aheo P ora t io„of t heda te controll e r240. For the sake 
of sunplicit* „nl y two of the8e ^ ^ ^ ^ 

base_address and dcejnode. 

Once the data cache controller 240 and data cache 230 are enabled the 
data cache oo„tro..er i„ tially opeMKs intne ^ ^ ^ ^ ^ ^ 

mvahd. At the end of an instrurtion, the data cache contralto, 240 and oache 
230 always averts to the n„ mal mode of operation. In all of the foUowine 
modes except the Tta-Kd*, mode, there i. aB "Anto-fill and ^ 
option. By settinga bit in the doc^ regis,*, it is possible t0 m ^ entto 
cache starting a. the addrees stored in the base.address register. During 
U- operate the data quests ton, ft. opBrand 0rBani2eK , ^ 
247^48 «. ^ ollt ^ a, opcration is ^ cactc 

validated at the end of this operation, 
a. Normal Cache Mode 

to «hi S mode, the tw, operand orders snpply , he extemal 
men.o.y addresses of the data rented. The address generator ,881 
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oufcute the compbte ertema. mcmory addK8M8 wb . ch aK 

'"^ "« — » « - that „ the data 
nested rs res iden, it the memory rache 230 . „ „„,„ ^ 

«« not in cache 230, da, a will he r^atod fem the input ^ ^ 

« ^02. Round Robin scheduling will 

earning will be implemented to aervice percent 

simultaneous requests. 

Forsimultoneous if eoe of the data items „ resident ta Mche 

*■ «■ be placed e„ the least significant 32 Mte of cach ^ ' 

lit. o*or data w.,1 be ro,uo S tod the ^ 

• b. Tie Single Output Genera! Cel., Space Convert Made 

ta tk. mode, the req uaat comes font operand o^anfe B in a, fom of 
a I2.hU b*e address. The re9 „ e8l ed data ta ate 8-M« roIor output 
values as previous* diseussed with reference to K s 60. The 12-bit addmes 
« 'fed to the indent inputs of the address generator* 188! an d the eight 
address generators 1881 genemte eight diflw 82-bu OTInplete 

memoryaddtessesofthefonnat showuinngore 96. Thebank, line and byte 
addresses of the ^ne rated romplrte ^ fa 

mft Table 12 and Pig. 61. Th* external memo* address is interpreted as 
etght 9-bit line and byte addressee. which am need to addrass a byto tan 
each of the eight banks of RAM. The cache in .cased to „ btaill the oight 
byte values ftom each bank whi* are cettntred to the operand organisers for 
sub s . q uent interpolation by the main data path 242 in accordance with the 
pr.no.ples previously discussed with reference to Fig. 60. As the singfc 
output color value , ab le is able ,o fit entirely within the cache memory 280 it 
■a preferable to load the entire surgle output color value table wiehin the cache 
memory 230 prior to enabling the single color conversion mode 
c. Multiple Output (Wra! Color Spue. Conversion Mods 

m this mode, a 12-bit word address is received from operand o^ani 2er B 
247. The guested data items are 32-bit color output values as proving 
drscnaeed with reference to Fig. 02. The 12-bit addreea is fed to the 
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mde^acke, tapnta tbe addre8s „ re ^ ^ ^ 
gators 1861 „ eight ditelU S2 . Ul extemai 

addresses of the fonnat shown In fi^ M . Tha lfaa and te6ilddre£se8 rf ^ 
comply external menoly addr96sss aro . n ^ ^ 

•*> 9-b« ^ a. 9 . lit oddre8s bein? decompo8ed ^ a 

Une add™ Md . 2 . bit tagaddreM aa Hisa]5sed previousiy with Kfe ^ ^ 

B* 63. Upon the tagaddre*, not W „ fcmd , the ^ ^ ^ 
appropriate d ata la loaded fan ,ha inpn, interface switch 252 CR g 2) Upon 
-o the data ba tag aeai.ab.e, the ontpnt data «. retlrasd „ the — 
organizers. 

d. JPEG Encoding Mode 

m thia mode, the necessary tables for JPEG encoding and other 
operational sub-sets are stored in each bank of cache RAM. The storage of 
■■ tables being previously described in the previous discussion of the JPEG 
encoding mode (Tables 14 and 16). 

e. Slow JPEG Decoding Mode 

In this mode, the data is organized in accordance with Table 17. 

f. Matrix Multiplication Mode 

In thia mode, the cache is utilized to access 256 byte lines of data 

g. Disabled Mode 

In this mode, all reacts are p as8ed through to the input interface 
switch 252. 

h. Invalidate Mode 

In this mode, the contents of the entire cache are invalidated by clearing 
all the line valid status bits. 
3 . 1 8 . 7 Input Interface Switch 

Retnrninsagata «o Pig. 2, the inpn, interface switch 252 performs the 
Amotion „, arbitrating data req ne S , s fan the pixel organiser 246, the data 
cache contra, 2 40 the inetruotioQ ^ ^ ^ 
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interface switch 262 tart, addressee and data a, required to the eternal 
interface controller 238 and local memory controller 236. 

The input interface switch 252 stores in one of Us configuration 
renter the base address or the memory object in the host memorv map 
> Tins . a virtual address that must be aligned on a page boundary, hence 20 
address bits are required. For each request made by the pisel organic 
data cache controller, instruction contreller, the input interface switch 252 
first subtracts the co-processor's base address bite from the most significant 6 
bits of the start address of the data. If the result is negative, or the most 
. mgnificant 6 bite of the result .re non-zero, this indicates that the desired 
destination is the PCI bus. 

If the most significant 6 bits of the result are zero, this indicates that 
the data maps to a co-processor's memory location. The input interface 
switch 252 then need* to check the next 3 bits to determine if the co- 
processor's location is legal or not. 

The legal co-processor's locations that may act as a source of data are: 

1) 16 Mbytes occupied by the Generic interface, beginning at an offset 
of 0x01000000 from the co-processor's base address. 

2) 32 Mbytes occupied by the local memory controller (LMC), 
starting at an offset of 0 X 02000000 from the base address of the co- 
processor's memory object. 

Requests that map to an illegal co-processor's location are flagged as 
errors by the Input Interface Switch. 

The PCI bus is the source of data corresponding to any addresses that 
map outside of the range occupied by the co-processor's memory object. An i- 
eource signal is used by the input interface switch to indicate to the EIC 
whether requested data is to originate from the PCI bus or the Generic 
interface. 
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After the address decoding process M 
~a Bna in terf . a wlKn J Jl £ ~ » ~- * the 

a data faction * the inpu , ' ^ ™" EIC " MC - 
-* ™. However : Z t 7 WlEn haVe 

- «*r to define when the en™, data , 

data transaction has completed 
I»e input tnterface switch 2so „ . .. P'eiea. 

be instantly met since there aw A . , * * 0t aU re( « uea ^ 

— — need by ^TJl^— * 
Programmable. Control hit, ■„.• Pnority-b a8ed and 

Umtrol bits within a configuration regi.t^r nf a ■ 
interface switch BDerift, f u Q i . agister of the input 

w s P ecir j r the relative priorities r,f • 

ntoMtfaxac, to the 8an!e re60 . *" """-^ •» 

deduce the new winners. ^ 8 ° heme to 



As immediate access to a resou™ ™ 
toterface switch needs to a t 1 * ° P ° 88iWe ' the ^ 

data cache co DtroIler and 
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■nation confer. Tie addres. 400*0,863 check, the address j. a letal 
one an. Prt » anya„d res8 re . nappi ^ ^ ^ ^ ^ 

wh.cn re ,„e 8t to pa, s fa. one Baa transceiver 661 * . seoonJ ffius 
transceiver^. Preferrably, the priority systcm „ proBIammabIe 
« The Bus transceivers 86!, 662 contain ,11 «. neeemry ntnltipfc*,,., 

demtdtiplestng and tnatat* bnfferh* to enable caseation ova, the 
vanous interfacea to the input interfece switch. 
3. 18.8 Local Memory Controller 

Returning again to Fig. 2, the local memory controller 236 is 

quests between the local .emery and modules within the oo-p_ 
The local memory controller 236 responds to write revests from the result 
organizer 249 and read quests from the input interface switch 252 
Add^onally, it also responds to both read and write requests from the 
- peripheral interface controller 237 and the usual global CBus input The 
local memory controller util.es a programmable priority system and further 
utilizes FIFO buffers to maximize throughput. 

In the present, invention, a mu l ti . port burst dymmic 
controller is utilized in addition to using First-In-First-Out (FIFO) buffers to 
de-couple the ports from a memory array. 

Fig. 146 depicts a block diagram of a fbur-port burst dynamic memory 
controller according to a first embodiment of the present invention. The 
circuu includes two write ports (A 1944 and B 1946) and two read ports (C 
1948 and D 1950) that require access to a memory array 1910. The data 
paths from the two write ports p a8s through separate FIFOs 1920, 1922 and 
to the memory array 1910 via a multiplexer 1912. while the data paths of the 
read ports 1948, 1950 pass from the memory array 1910 via separate FIFOs 
1936, 1938. A central controller 1932 coordinates all port accesses as well 
as driving all the control signals necessary to interface to the dynamic 
memory 1910. A refresh oounter 1934 determines when dynamic memory 
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with the controller 1932. 

1.1. a,?' 6 '?' ^ 18 "*« to «■» — «" «v 

■ "J. -o, 1922 . fem to m 1936 , 1938 to J Kad - 

transfers „ or fa. the nemMy amy ^ ^ fe fc ^ 

and read ports 1944, 1946, 1948, 1950. 

10 Data is written into the memory arrav 191f) ™ a ■* u 

v y iyiu wa either one of the write 

Pons 19«, me. We ^ _„ to ^ ^ ^ 
-(r a FIFO 1920 , ,„ which „ initially empty Data tran6fem ^ J 
the wrrte ports 1944, ,946 proceed unintpeded unffl 0. FIFO l92 0 1922 „ 
filled, cr the hart is ended. When data U tot written int „ the ^ J920 
- 1922, the control 1932 arbitrates with the exports „ tbe DRAM ^ 
When accese is granted, date i. read out of the FIFO 1920, , 922 it the ^ 
raj. end wntten into the ntentory array 1910. A burst write cjda to DRAM 

"1* W Whena PreBe ' —»««"«■ ™* have heen .torod in 
the BTFO 1920, ,922. or wt,n the burst fnJm thfi ^ ^ ^ fc 

case, the hurst to DRAM 1910 proeneds when granted and continue unti, the 
mo 19*0, 1 922 „ enrptied, or there is a c^e ^ ta . ^ priority 
port. In either event, data continues to be written into the F[FO 1920 1922 
from the write port without hindrance, until the FIFO is M led , ot mH1 the 
burst ende and a new burst i, started. h the Uaer ^ ^ ^ ^ 
proceed until the preview burBt ^ been emptied ^ ^ ^ ^ 

and written to the DRAM 19,0. b the former ease, data transfers 
reconvenes* as soon as the first word is rcad M „p the FIFO 1 9 20 1 92 2 and 
written to DRAM D»a to the highorrato o, data transfer..,^ 

™« 1920, 1922 , It is on) , possibfe _ t|M writo pm ]94e ' 

the contreUer ,932 is interrupted with ,* revests ta the Mhpp 
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Any interruption to the data transfers fiom the write ports 1944, 1946 to the 
FIFOs 1920, 1922 is preferably kept to a minimum. 

The read ports 1948, 1950 operate in a converse fashion. When a read 
port 1948, 1950 initiates a read revest, a DRAM cycle is unmediately 
> -q-eted. When granted, the memory array 1910 is read and data j- 
wntten into the corresponding FIFO 1936, 1938. As soon as the first data 
word is written into the FIFO 1936, 1938, it is available for read-out by the 
read port 1948, 1950. Thus there is an initial delay in obtaining the first 
datum word but after that there is a high likelihood that there are no further 
. delays in retrievxng the successive data words. DRAM reads will be 
terminated when a higher priority DRAM request is received, or if the read 
FIFO 1936, 1938 becomes full, or when the read port 1948, 1950 requires no 
mom data. Once the read has been terminated in this way, it is not 
restarted until there is room in the FIFO 1936, 1938 for a preset number of 
data words. Once the read port terminates the cycle, any data remaining in 
the FIFO 1936, 1938 ie discarded. 

In order to keep DRAM control overheads to a mmhnum, ^arbitration 
for the DRAM access is restricted so that burets cannot be interrupted until a 
preset number of data words have been transferred (or until the corresponding 
write FIFO 1920, 1922 is emptied, or read FIFO 1936, 1938 is filled). 

Each of the access ports 1944, 1946, 1948, 1950 has an associated 
burst start address which is latched in a counter 1942 at the start of the burst 
Tins counter holds the current add TCS6 for transactions on that port so that 
should the transfer be interrupted, it can be resumed at any time at the 
correct memory address. Only the address for the currently active DRAM ^cle 
is selected by multiplexer 1940 and passed on to the row address counter 
1916 and column address counter 1918. The low order NT bits of address are 
inputted to the column counter 1918 while the higher order address bits are 
inputted to the row counter 1916. Multiplexer 1914 outputs row addresses 
from the row counter 1916 to the memory array 1910 during the row address 
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tune of the DRAM and passes addresses fom the ro]umi) ^ 

1918 during column address tta. of the DRAM. The row address ^ 
1916 and the column address MU nte, 1918 are |oaded ^ ^ rf 
burst to the memory array DRAM 1910. This is ,™ both at the star, of a 
. po« cycle and at the continuation of an interrupted burs,. The column address 
count* .1918 is indented after each transfer to memory has taken pla ce 
whale the row address counter 1916 is tucremented when the column adtte-s 
counter 1918 rolls over to a count of zero. When the fetter happens, the bur,, 
must be terminated and restarted at tie new row address. 

In the preferred embodiment i, i, assumed that memory array 1910 
comprises 4 * 8 bit byte Unas making up a 32 bits per wort. Further there is 
associated with each write port 1944, 1946 a set of four byte write enable 
s.gnals 1950, 1952 which individual allow data to be written to each 8-bit 
portion of each 32-bit data word in the memory array 1910. Since it is 
possible to arbitrarily mask the writing of data to any byte within each word 
that ,s written to the memory array mo, it is necessary to store the write 
enable rnfomtation alongwith each data word in corresponding FIFOs 19 26 
1928. These FIFOs 1926, 1928 are controlled by the same signals the, 
control the write FIFOs 1920. 1922 but am only 4 bits wide instead of the 92 
hit. required for the write data in FIFOs 1920. ,922. h like fashion 
multiple** 1930 is controlled in the same manner as the multiplexer 1912' 
The selected byte write enables are inputted to the controller 1932 which uses 
the tnformation to selectively suable or disable writing to the addressed word 
m the memory army 1910 in synchronisation with the write data being 
inputted to the memory array 1910 by way of multiplexer 1912. 

The arrangement of Fig. 146 operates under the control of the 
controller 1932. Fig 147 la a state machine diagram depleting the derail of 
operation of eh, eontroUor 1932 ef Fig 146. After powar up and at tho 
completion of reset the state machine is forced into state IDLE 100 in which 
all DRAM control signals a* driven inactive (high) and multiplexer 1914 
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drives row addresses to the DRAM array 1910. When a refresh or cycle 
request is detected, the transition is made to state RASDEL1 1963. On the 
next clock edge the transition to state RASDEL2 1964 is made. On the next 
clockedge, if the cycle request and refreshhave gone away, the state machine 
- returns to state IDLE 1900. otherwize. when the DRAM tRP (RAS precharge 
timing constraint) period has been satisfied, the transition to state RASON 
1966 is made at which time the row address strobe signal, RAS, is assorted 
low. After tRCD (RAS to CAS delay timing constraint) has been satisfied 
the transition to state COL 1968 is made, in which the multiplexer 1914 is 
• switched over to select column addresses for inputting to the DRAM arrav 
1910. On the next clock edge the transition to state CASON 1970 is made 
and the DRAM column address strobe (CAS) signal is driven active low 
Once the tCAS (CAS active timing constraint) has been satisfied, the 
transition to state CASOFF 1972 is made in which the DRAM column 
address strobe (CAS) is driven inactive high once again. At this point, if 
furtherdato words are to he transferred and a higher priority cycle request or 
refresh is not pending or if it is too soon to rearbitrate anyway, and once the 
tCP (CAS precharge timing constraint) has been satisfied, the transition back 
to state CASON 1970 will be made in which the DRAM column address 
strobe (CAS) is driven active low again. If no further data words are to be 
transferred, or if rearbitrating is taking place and a higher prionty cycle 
request or refresh is pending, then the transition is made to state RAS OFF 
1974 instead, providing tRAS (RAS active timing constraint) and tCP (CAS 
precharge timing constraint) are both satisfied. In this state the DRAM row 
address strobe (RAS) signal is driven inactive high. On the next clock edge 
the state machine returns to state IDLE I860 ready to start the next cycle 

When in state RASDEL2 1964 and n refresh request is detected, the 
transition will be made to state RCASON 1980 once tRP (RAS precharge 
timing constraint) has been satisfied. In this state DRAM column address 
slrobe is driven active low to start a DRAM CAS before RAS refresh cycle 
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On he « ^ ed8B the traflaition to ste[e rrason i9?8 . ^ 

addlB5S Stebe <RAS > - active l„ w . When tCAS (CIAS 

197. -U b. made ta ^ DRAM ^ ad!bes6 B(robe 
~ e h. ^WM,,.^^^ " 
-~ — EASCr 187d „ made m wUdl DKAM _ aiiKm etrote 
CBA8). invenmacive bi^e fl 3c l i 5 o ly on d in. t ae n. state 

*" C ° nti1 "™ " ""° ^ " ■"" m " 1 TOAM the 
transition back to state IDLE 1960. 

^ sigrah « . w rate of OM per 15 miTOseOTidSi ^ 

»* - determined by tlM particidar DRAM mtmtm requjremen(6 
Wb. a refresh .quest is aaaerad , it Kmains ^ ^ 

^.cWiea^entis^^ 

™° RCAS0N »«0 - "mains averted un,i. the 
scats machine detects the refresh request has been ue-asserted. 

In Fig. 148, there is set out in pseuoo code form, the operation of the 
abator 192d „ tK , 14e . , ^ ^ rf ^ 

four o^e request i3 ^ sooess to ,he memo,, array 1910 , and ^ . 
mechamsm for modish, cole req ue„er priorities in order to maintain a 
fiuraceessragime. * aymbois uscd h this ^ ^ ^ ^ ^ 

Each requestor has 4 bits associa.ed „i,h i, lhat repMsaIlt ^ 
requesters p* rity . «. two hiehorder bits am prcset „ „ 
fry wa y of Nation values set in a senora, conflation regi8ter ne 

^iowor d erM, 8 ofprion lyare heldina 2 .bi« counter that is updntedb, the 
aerator ,4. When determining ,he victor in an arbitration, the arbitlor 

1 J24 simply oomparea the 4-bit values of each of th a . 

oi each oi the requesters and grants 
access .o the ^ the ^ ^ ^ ^ ^ 

.^e^Worder^bitprio^ccunivaJueisdearedto^,^^^ 
requesters with identical high order 2*it pn„n, y vaiues andwhose order 
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2-b„ pnority i. le 88 than the ^ ,„„ order ^ ^ ^ ^ 

order 2-bi, priority counts indented byotw. Tbis has th . effea „ f making 
a re,ue S te, that has just bc6B ^ ^ ^ ^ ^ ^ ^ 

■owes, Priority ^ «sque 8 ter 8 with the same priority high order 2-bi. value 
■ The priority W order 2-bi, value of other re ,ueste K with priority high order 
2-bit value different to that of the winning Kq ^ t ^ ^ afeted ^ 
high order two lite of priority determine the overall priority of a requester 
wble the low order two bite inati. a fair arbitration scheme among tequeatet, 
with identical high order priority. This scheme allows a number of 
« arbitration schemes to be implemented ranging*™ hard-wired tod prioritv 
(high order two bits of each tequester unique), though part bating and part 
hatd-wed (some high order 2-bit prioritise different to others, but not all) to 
strictly fair and rotating (all priority high order 2-bit fiehb the same). 

Fig. H9 depicts the structure of the priority bits associated with each 
.« ^neater and how the bite are utilized. ,t a ls „ defines the symbofe uaed in Fig 
148. 5 * 

In the preferred embodiment, the various FIP0 6 1920. 1922, 1938 and 
1936 are 32 bits wide and 32 words deep. This particular depth prides a 
good compromise between efficiency and circuit area consumed. However the 
depth may be altered, with a corresponding change in performance, to suit the 
noods of any particular application. 

Also, the four port arrangement shown is merely a preferred 
embedment. Even the provision of a singb TWO buffer between the memory 
array and either a read or write port will provide some benefits. However 
the use of multiple read and write ports provides the greatest potential speed 
increase. 

3.18.9 Miscellaneous Module 

The miscellaneous modulo 239 provides dock generation and selection 
for the operation of the co-processor 224, reset synchronization, multiplexing 
of error and interrupt signals by routing of internal diagnostic signals to 
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external pins as required, interfacing between the internal and external form 
of the CBus and multiplexing of internal and generic Bus signals onto a 
generic/external CBus output pins. Of course, the operation of the 
miscellaneous module 239 varies in accordance with clocking requirements 
; and implementation details depending on the ASIC technology utilized. 
3.18.10 External Interface Controller 

The followingdesuribed apsects of the invention relate to a method and 
an apparatus for providing virtual memory in a host computer system having 
a co-prucessor that shares the virtual memory. The embodiments of the 
invention seek to provide a ^-processor able to operate in a virtual memory 
mode in conjunction with the host processor. 

In particular, the co-processor is able to operate in a virtual memory 
mode of the host processor. The co-processor includes a virtual-memory-to- 
physical-memory mapping device that is able to interrogate the host 
processor's virtual memory tables, so as to map instruction addresses 
produced by the co-processor into corresponding physical addresses in the host 
processor's memory. Preferably, the virtual-memory-to-physical-memo^y 
mapping device forms part of a computer graphics co-processor for the 
production of graphical images. The co-processor may include a large number 
of modules able to form various complex operations on images. The mapping 
device is responsible for the interaction between the co-processor and the host 
processor. 

The external interface controller (EIC) 238 provides the co-processors 
interface to the PCI Bus and to a generic Bus. It also provides memory 
management to translate between the co-processor's internal virtual address 
space and the host system physical address space. The external interface 
controller 238 acts as a master on the PCI Bus when reading the data from 
the host memory in response to a request from the input interface switch 252 
and when writing data to host memory in response to a request from the 
result organizer 249. The PCI Bus access is implemented in accordance the 
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well known standard with "PCI Local Bus specification, draft 2.1", PCI special 
interest group, 1994. 

The external interface controller 238 arbitrates between simultaneous 
requests for PCI transactions from the input interface switch 252 and the 
result organizer 249. The arbitration is preferably configurable. The types 
of requests received include transactions for reading less than one cache line 
of the host co-processor at a time, reading between one and two cache lines of 
the host and reading two or more cache lines of the host. Unlimited length 
write transactions are also implemented by the external interface controller 
238. Further, the external interface controller 238 optionally also performs 
prefetching of data. 

The construction of the external interface controller 238 includes a 
memory management unit which provides virtual to physical address 
mapping of host memory accesses for all of the co-processor's internal modules. 
This mapping is completely transparent to the module requesting the access. 
When the external interface controller 238 receives a request for host memory 
access, it initiates a memory management unit operation to translate the 
requested address. Where the memory management unit is unable to 
translate the address, in some cases this results in one or more PCI Bus 
transaction to complete the address translation. This means that the 
memory management unit itself can be another source of transaction requests 
on the PCI Bus. If a requested burst from the input interlace switch 252 or 
results organizer 249 crosses the boundary of a virtual page, the external 
interface controller 238 automatically generates a memory management unit 
operation to correctly map all virtual addresses. 

The memory management unit (MMU) (915 of Fig. 150) is based 
around a 16 entry translation look aside buffer (TLB). The TLB acts as a 
cache of virtual to physical address mappings. The following operations are 
possible on the TLB: 
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1) Compare: A virtual address is presented, and the TLB returns 
either the corresponding physical address, or a TLB miss signal (if no valid 
entry matches the address). 

2) Replace: A new virtual-to-physical mapping ia written into the 
TLB. replacing an existing entry or an invalid entry. 

3) Invalidate: A virtual address is presented; if it matches a TLB 
entry, that entry is marked invalid. 

4) Invalidate All. All TLB entries are marked invalid. 

5) Read: A TLB entry's virtual or physical address is read, based on 
a four bit address. Used for testing only. 

6) Write: A TLB entry's virtual and physical address is written, 
based on a four bit address. 

Entries within the TLB have the format shown in Fig. 151. Each 
valid entry consists of a 20-bit virtual address 670, a 20-bit physical address 
G71. and a flag which indicates whether the corresponding physical page is 
writable. The entries allow for page sizes as small as 4kB. A register in 
the MMU can be used to mask off up to 10 bits of the addresses used in the 
comparison. This allows the TLB to support pages up to 4MB. As there is 
only one mask register, all TLB entries refer to pages of the same size. 

The TLB uses a "least-recently-used" (LRU) replacement algorithm. 
A new entry is written over the entry which has the longest elapsed time since 
it was last written or matched in a comparison operation. This applies only 
if there are .no invalid entries; if these exist, they are wntten to before any 
valid entries are overwritten. 

Fig. 152 shows the flow of a successful TLB oo mpare operation. The 
incomingvirtual address 880 is divided into 3 parts 881 - 883. ' The lower 12 
bits 881 are always part of the offeet inside a page and so are passed directly 
on to the corresponding physical address bits 885. The next 10 bits 882 are 
either part of the offeet, or part of the page number, depending on the page 
size, as set by the mask bits. A zero in the mask register 887 indicates that 
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the bit is part of the page offset, and should not be used for TLB comparisons. 
The 10 address bits are logically "ANDED" with the 10 mask bits to give the 
lower 10 bits of the virtual page number 889 for TLB lookups. The upper 10 
bits 883 of the virtual address are used directly as the upper 10 bits of the 
virtual page number 889. 

The 20-bit virtual page number thus generated is driven into the TLB. 
If it matches one of the entries, the TLB returns the corresponding physical 
page number 872, and the number of the matched location. The physical 
address 873 is generated from the physical page number using the mask 
register 887 again. The top 10 bits of physical page number 872 are used 
directly as the top 10 bits of the physical address 873. The next 10 bits of 
physical address 872 are chosen 875 from either the physical page number (if 
the corresponding mask bit is 1), or the virtual address (if the mask bit is 0). 
The lower 12 bits 885 of physical address come directly from the virtual 
address. 

Finally, following a match, the LRU buffer 876 is updated to reflect 
the use of the matched address. 

A TLB miss occurs when the input interface switch 252 or the results 
organizer 249 requests an access to a virtual address which is not in the TLB 
872. In this case, the MMU must fetch the required virtual-to-physical 
translation from the page tabic in host memory 203 and write it into the TLB 
before proceeding with the requested access. 

The page table is a hash table in the hosts main memory. Each page 
table entry consists of two 32-bit words, with the format shown in Fig. 153. 
The second word comprizes the upper 20 bits for the physical address and the 
lower 12 bits are reserved. The upper 20 bits of the corresponding virtual 
address are provided in the first word. The lower 12 bits include a valid (V) 
bit and writable (W) or a "read-only" bit, with the remaining 10 bits being 
reserved. 
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The page table entry contains essentially the same information as the 
TLB entry. Further flags in the page table are reserved. The page table 
itself may be, and typically is, distributed over multiple pages in main 
memory 203, which in general are contiguous in virtual space but not physical 
space. 

The MMU contains a set of 16 page table pointers, setup by software, 
each of which is a 20-bit pointer to a 4kB memory region containing part of the 
page table. This means the co-processor 224 supports a page table 64kB in 
size, which holds 8k page mappings. For systems with a 4kB page size, this 
means a maximum of 32MB of mapped virtual address space. Preferably, 
the page table pointers always reference a 4kB memory region, regardless of 
the page size used in the TLB, 

The operation of the MMU following a TLB miss is shown 600 in Fig. 
154, as follows: 

1 . Execute the hash function 892 on the virtual p age number 891 
that missed in the TLB, to produce a 13-bit index into the page table. 

2. Use the top 4 bits 894 of the page table index 894, 896 to 
select a page table pointer 895. 

3. Generate the physical address 890 of the required page table 
entry, by concatenating the 20-bit page table pointer 895 with the lower 9 bits 
of the page table index 896, setting the bottom 3 bits to 000 (since page table 
entries occupy 8 bytes in host memory). 

4. Read 8 bytes from host memory, starting at the page table 
entry physical address 898. 

5. When the 8-byte page table entry 900 is returned over the PCI 
bus, the virtual page number is compared to the original virtual page number 
that caused the TLB miss, provided that the VALID bit is set to 1 . If it does 
not match, the next page table entry is fetched (incrementing the physical 
address by 8 bytes) using the process described above. This continues until a 
page table entry with a matching virtual page number is found, or an invalid 
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page table entry is found. If an invalid page table entry is found, a page fault 
error ia signalled and processing stops. 

6. When a page table entry with a matching virtual page number 
is found, the complete entry is written into the TLB using the replace 
operation. The new entry is placed in the TLB location pointed to by the 
LRU buffer 876. 

The TLB compare operation is then retried, and will succeed, and the 
originally requested host memory access can proceed. The LRU buffer 876 is 
updated when the new entry is written into the TLB. 

The hash function 892 implemented in the EIC 238 uses the following 
equation on the 20 bits of virtual page number (vpn): 

index = ((vpn»Sj) XOR (vpn»S 2 )XOR(vpn»S 3 )) & Oxlfff; 
where sj, s 2 and S 3 are independently programmable shift amounts 
(positive or negative), each of which can take on four values. 

If the linear search through the page table crosses a 4kB boundary, the 
MMU automatically selects the next page table pointer to continue the search 
at the correct physical memory location. This includes wrapping around from 
the end of the page table to the start. The page table always contains at 
least one invalid (null) entry, so that the search always terminates. 

Whenever the software replaces a page in host memory, it must add a 
page table entry for the new virtual page, and remove the entry corresponding 
to the page that has been replaced. It must also make sure that the old page 
table entry is not cached in the TLB on the co-precessor 224. This is achieved 
by performing a TLB invalidation cycle in the MMU. 

An invalidation cycle is performed via a register write to the MMU, 
specifying the virtual page number to be invalidated, along with a bit that 
causes the invalidation operation to be done. This register write may be 
performed directly by the software, or via an instruction interpreted by the 
Instruction Decoder. An invalidation operation is performed on the TLB for 
the supplied virtual page number. If it matches a TLB entry, that entry is 
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marked invalid, and the LRU table updated so that the invalidated location 
is used for the next replace operation. 

A pending invalidate operation has priority over any pending TLB 
compares. When the invalidate operation has completed, the MMU clears 
the invalidate bit, to signal that it can process another invalidation. 

If the MMU fails to find a valid page table entry for a requested 
virtual address, this is termed a page feult. The MMU signals on error, and 
stores the virtual address that caused the fault in a software accessible 
register. The MM U goes to an idle state and waits until this error is cleared. 
When the interrupt is cleared, the MMU resumes from the next requested 
transaction. 

A page fault is also signalled if a write operation is attempted to a 
page that is (not marked writable) marked read only. 

The external interface controller (EIC) 238 can service transaction 
requests from the input interface switch 252 and the result organizer 249 that 
are addressed to the Generic bus. Each of the requesting modules indicates 
whether the current request is for the Generic Bus or the PCI bus. Apart 
from using common buses to communicate with the input interfeoe switch 252 
and the results organizer 249, the EIC's operation for Generic bus requests is 
entirely separate from its operation for PCI requests. The EIC 238 can also 
service CBus transaction types that address the Generic bus space directly. 

Fig. 150 shows the structure of the external interface controller 238. 
The IBus- requests pass through a multiplexer 9 10, which directs the requests 
to the appropriate internal module, based on the destination of the request 
(PCI or Generic Bus). Requests to the Generic bus pass on to the generic bus 
controller 911, which also has RBus and CBus interfeces. Generic bus and 
PCI bus requests on the RBus use different control signals, so no multiplexer 
is required on this bus. 

IBus requests directed to the PCI bus are handled by an Bus Driver 
(IBD) 912. Similarly, an RBus Receiver (RBR) 914 handles the RBus 
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requests to PCI Each of the BD 912 and RBR 914 drive virtual addles 
to the memory management unit (MMU) 915, which provide* physical 
addles in return. The IBD, RBR and MMU can each request PCI 
transaction, which are generated and controlled by the PCI master mode 
controller (PMC) 917. The BD and the MMU request only PCI read 
transactions, while the RBR requests only PCI write transactions. 

A separate PCI Target Mode Controller (PTC) 918 handles all PCI 
transactions addressed to the eo-processor as a target. This drives CBus 
master mode signals to the instruction controller, allowing it to access all 
other modules. The PTC passes returned CBus data to be driven to the PCI 
bus vxa the PMC, so that control of the PCI data bus pins comes from a single 



source. 



CBus transactions addressed to EIC registers and module memory 
are dealt with by a standard CBus interlace 7. All submodules revive some 
bits from control registers, and return some bits to status registers, which are 
located inside the standard CBus interface. 

Parity generation and checking for PCI bus transactions is handled by 
the parity generate and check (PGC) module 921. which operates under the 
control of the PMC and PTC. Generated parity is driven onto the PCIbus, as 
are parity error signals. The results of parity checking are also sent to the 
configuration registers section of the PTC for error reporting. 

Fig. 155 illustrates the structure of the Bus driver 912 of Fig. 150. 
Incoming Bus address and control signals are latched 930 at the start of a 
cycle. An or-gate 931 detects the start of the cycle and generates a start 
signal to control logic 932. The top address bits of the latch 930, which form 
the virtual page number, are loaded into a counter 935. The virtual p agB 
number is passed to the MMU 915 (Fig. 150) which returns a physical page 
number which is latched 936. 

The physical page number and the lower virtual address bits are 
recombined according to the mask 937 and form the address 938 for PCI 
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requests to the PMC 717 (Fie 102^ TT~ , , 

The burst count for the cycle is also 
loaded into a counter 939 Prufufri, *• 

>vr » oa . i-reietch operations use another counter 941 and 

an address latch and compare circuit. 943. 

Data returned from the PMC is londed into a FIFO 944, along with a 
- marker which indicate., whether the data is part of a prefetc h. As data 
becomes available at the front of the FIFO 944, it is clocked out by the read 
lope m synchronization latches 945,946. The read logic 946 also generates 
the IBus acknowledge signal. 

A central control block 932. include staC e machines, ^ the 

PMC. 

The virtual page number counter 935 is loaded at the start of an IBus 
transaction with the page number bits from the IBus address. The top 10 bit 
of thxs 20-bit counter always come from the incoming address. For the lower 
10 lata, each bit is loaded from the incoming address if the corresponding 
mask bit 937 is set to 1; otherwize, the counter bit is set to 1. The 20-bit 
value is forwarded to the MMU interface. 

In normal operation the virtual page number is not used after the 
nutrnl address translation However, if the H3D detects that the hurst has 
crossed a page boundary, the virtual page counter i s incremented, and another 
translation is performed. Since the low order bits that are not part of the 
virtual page number are set to 1 when the counter is loaded, a simple 
increment on the entire 20-bit value always causes the actual page number 
field to increment. The mask bits 937 are used again after an increment to 
set up the counter for any subsequent increments. 

The physical address is latched 936 whenever the MMU returns a 
valid physical page number after translation. The mask bits axv use d to 
correctly combine the returned phy 3 icalpage number with the original virtual 
address bits. 
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The physical address counter 938 is loaded from the physical address 
latch 936. It is incremented each time a word is returned from the PMC. 
The count is monitored as it increments, to determine whether the transaction 
is about to cross a page boundary. The mask bits are used to determine 
which bits of the counter should be used for the comparison. When the 
counter detects that there are two or less words remaining in the page, it 
signals the control logic 932, which the terminates the current PCI request 
after two more data transfers, and requests a new address translation if 

required. The counter is reloaded after the new address translation, and PCI 
requests resumed. 

The buret counter 939 is a 6-bit down counter which is loaded with the 
IBus burst value at the beginning of a transaction. It is decremented every 
time a word is returned from the PMC. When the counter value is two or less, 
it signals to the control logic 932, which can then terminate the PCI 
transaction correctly with two more data transfer (unless prefetching is 
enabled). 

The prefetch address regwter 943 is loaded with the physical address 
of the first word of any prefetch. When the subsequent ffius transaction 
starts, and the prefetch counter indicates that, at least one word was 
successfully prefetched, the first physical address of the transaction is 
compared to the value in the prefetch address latch. If it matched, the 
prefetch data ie used to satisfy the IBus transaction, and any PCI transaction 
requests start at the address after the last prefetched word. 

The prefetch counter 941 i» a four bit counter which is incremented 
whenever a word is returned by the PMC during a prefetch operation, up to a 
maximum count equal to the depth of the input FIFO. When the subsequent 
Bus transaction matches the prefetch address, the prefetch count is added to 
the address counter, and subtracted from the buret counter, so that PCI 
requests can start at the required location. Alternatively, if the IBus 
transaction only requires some of the prefetched data, the requested burst 
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length is subtracted from the prefetch count , and added to the latched prefetch 
address, and the remaining prefetch data is retained to satisfy further 
requests. 

The Data FIFO 944 is a 8 word by 33 bit asynchronous fall through 
- FIFO. Data from the PMC is written into the FIFO, along with a bit 
indicating whether the data i s part of a prefetch. Data from the front of the 
FIFO is read out and driven onto the IBus as soon as it becomes available. 
The logic that generates the data read signals operates synchronously to elk, 
and generates the JBus acknowledge output. If the transaction is to be 
satisfied using prefetched data, signals from the control logic tell the read 
logic how many words of prefetched data should be read out of the FIFO. 

Fig. 156 illustrates the structure of the RBus Receiver 914 of Fig. 150. 
Control is split between two state machines 950, 951. The Write state 
machine 951 controls the interface to the RBus. The input address 752 is 
latched at the start of an RBus burst. Each data word of the burst is written 
in a FTFO 754, along with its byte enables. If the FIFO 954 become full r- 
ready is deasserted by the write logic 951 to prevent the results organiser 
from attempting to write any more words. 

The write logic 951 notifies the main state machine 950 of the start of 
an RBus burst via a resynchronized start signal to prevent the results 
organizer from trying to write any more words. The top address bits, which 
form the virtual page number, are loaded into a counter 957. The virtual 
page number is passed to the MMU, which returns a physical page number 
958. The physical page number and the lower bits of the virtual address are 
rccombincd according to the mask, and loaded into a counter 960, to provide 
the address for PCI requests to the PMC. Data and byte enables for each 
word of the PCI request are clocked out of the FIFO 954 by the main control 
logic 950, which also handles all PMCM interface control signals. The main 
state machine indicates that it is active via a busy signal, which is 
resynchronized and returned to the write state machine. 
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The write state machine 951 detects the end of an RBus burst using r- 
final. It stops loading data into the FIFO 954. and signals the main state 
machine that the RBus burst has finished. The main state machine 
continues the PCI requests until the Data FIFO has been emptied. It then 
deaeserte busy, allowing the write state machine to start the next RBus 
buret. 

Returning to Fig. 150, the memory management unit 915 is 
responsible for translating virtual page numbers into physical page numbers 
for the Bus driver (IBD) S12 and the RBus receiver (IBR) 914. Turning to 
Fig. 157, there is illustrated the memory management unit in further detail. 
A 16 entry translation lookaside buffer (TLB) 970 takes its inputs from, and 
drives its outputs to, the TLB address logic 971. The TLB control Logic 972, 
which contains a state machine, receives a request, buffered in the TLB 
address logic, from the RBR or IBD. It selects the source of the inputs, and 
selects the operation to be performed by the TLB. Valid TLB operations are 
compare, invalidate, invalidate all, write and read. Sources of TLB input 
addresses are the IBD and RBR interfaces (for compare operations), the page 
table entry buffer 974 (for TLB miss services) or registers within the TLB 
address logic. The TLB returns the status of each operation to the TLB 
control logic. Physical page numbers from successful compare operations are 
driven back to the IBD and RBR. The TLB maintains a record of its least 
recently used (LRU) location, which is available to the TLB address logic for 
use as a location for write operations. 

When a compare operations fails, the TLB control logic 972 signals 
the page table access control logic 976 to start a PCI request. The page table 
address generator 977 generates the PCI address based on the virtual page 
number, using its internal page table pointer registers. Data returned from 
the PCI request is latched in the page table entry buffer 974. When a page 
table entry that matches the required virtual address is found, the physical 
page number is driven to the TLB address logic 977 and the page table access 
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control logic 976 signals that the page table access is complete. The TLB 
control logic 972 then writes the new entry into the TLB, and retries the 
compare operation. 

Register signals to and from the SCI are asynchronized 980 in both 
directions. The signals go to and from all other submodules. A module 
memory interface 981 decodes access from the Standard CBus Interface to the 
TLB and page table pointer memory elements. TLB access are read only, 

and use the TLB control logic to obtain the data. The page table pointers are 

read/write, and are accessed directly by the module memory interface. These 

paths also contain synchronization circuits. 

3.18.11 Peripheral Interface Controller 

Turning now to Fig. 158, there is illustrated one form of peripheral 

interface controller (PIC) 237 of Fig. 2 in more detail. The PIC 237 works in 

one of a number of modes to transfer data to or from an external peripheral 

device. The basic modes axe: 

1) Video output mode, m this mode, data is transferred to a 
peripheral under the control of an external video clock and clock/data enables. 
The PIC 237 drives output clock and clock enable signs with the required 
timing with respect to the output data. 

2) Video input mode. In this mode, data is transferred from a 
peripheral under the control of an external video clock and data enable. 

3> Centronics mode. This mode transfers data to and from the 
peripheral according to the standard protocol defined in IEEE 1284 standard. 

The PIC 237 decouples the protocol of the external interface from the 
internal data sources or destination in accordance with requirements. 
Internal data sources write data into a single stream of output data, which is 
then transferred to the external peripheral according to the selected mode. 
Similarly, all data from an external peripheral is written into a single input 
data stream, which is available to satisfy a requested transaction to either of 
the possible internal data destinations. 
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TheTe are three possible sources of output data: the LMC 236 (which 
uses the ABus), the RO 249 (which uses the RBus), and the global CBus. 
The PTC 237 responds to transactions from these data sources one at a time - 
a complete transaction is completed from one source before another source is 
considered. In general, only one source of data should be active at any time. 
If more than one source is active, they are served with the following priority - 
CBus, then ABus, then RBus. 

As usual, the module operates under the control of the standard CBus 
interface 990 which includes the PIC's internal registers. 

Further, a CBus data interface 992 is provided for accessing and 
controlling peripheral devices via the co-processor 224. An ABus interface 
991 is also provided for handling memory interactions with the local memory 
controller. Both the ABus interface 991 and CBus data interface 992 in 
addition to the result organizer 249 send data to an output data path 993 
which includes a byte - wide FIFO. Access to the output data path is 
controlled by an arbiter which keeps track of which source has priority or 
ownership of the output stream. The output data path in turn interfaces 
with a video output controller 994 and Centronics control 997 depending on 
which of these is enabled. Each of the modules 994, 997 reads one byte at a 
time from the output data path's internal FIFO. The Centronics controller 
997 implements the Centronics data interfacing standard for controlling 
peripheral devices. The video output controller includes logic to control 
output pads according to the desired video output protocols. Similarly, a 
video input controller 998 includes logic to control any implemented video 
input standard. The video input controller 998 outputs to an input data path 
unit 999 which again comprizes a byte wide input FIFO with data being 
written into the FIFO asynchronously, one byte at a time, by either the video 
input controller 998 or Centronics controller 997. 



-505- 



164 - 69 



A data timer 996 contains various counters utilized to monitor the 
current state of FIFO's within output data paths 993 and input data path 



999. 



It can be seen from the foregoing that the co-processor can be utilized 
to execute dual streams of instructions for the creation of multiple images or 
multiple portions of a single image simultaneously. Hence, a primary 
instruction stream can be utilized to derive an output image for a current page 
while a secondary instruction stream can be utilized, duringthose times when 
the primary instruction stream is idle, to begin the rendering of a subsequent 
page. Hence, in a standard mode of operation, the image for a current page is 
rendered and then compressed utilising the JPEG coder 241. When it is 
required to print out the image, the co-processor 241 decompresses the JPEG 
encoded image, again utilising the JPEG coder 241. During those idle times 
when no further portions of the JPEG decoded image are required by an output 
device, instructions can be carried out for the compositing of a subsequent 
page or band. This process generally accelerates the rate at which images 
are produced due to the overlap operating of the co-processor. In particular, 
the coprocessor 224 can be utilized to substantial benefit in the speeding up 
of image processing operations for printing out by a printer attached to the co- 
processor such that rendering speeds will be substantially increased. 

It will be evident from the foregoing that discussion of the preferred 
embodiment refers to only one form of implementation of the invention and 
modifications, obvious to those skilled in the art, can be made thereto without 
departing from the scope of the invention. 
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Appendix A: 

The Coprocessor Microprogramming 

This section del aits the aciions DCrfcrm"H i'm«» ma in- »u 

ins.n.ctionis executed. ^^S^S^^^^S^. ^ 
-on execution b pcrfonned via intcnuu register reads a* wrto "o£ST2 n " 
esso, can be fully microprogrammed via the Extern! Cbus SSJ^E h 
rata .metfacc Note that mic topping vlEX SSS S. « ^'v^ 
difficult due to the problems of synchronising the host y Cty 

In this Section ii is assured that the reader is familiar with the coprocessor's: 
i- execution model, 

2. instruction set and its coding, 

3. register set, 
4- and internal structure,. 

A.1 General Notes 
A.1.1 General CoprocessorSetup Principles 

For all instructions other than Control Instructions and Local DMA Instructions the data 
flow through the coprocessor is fundamentally underthc control of the Pixel oSfctt S 

flow of data and for determining when the final piece of data has been fetched S 
speaking, the other modules within the coprocessor simp.y ^TtJt^o*?^ 

i 

A.1 .2 Module Configuration Order 

SIS' T^" ™ " P f ° r CVCty ins,n, « ion - Some modules ,*ver configured in the 
course of ins.rucoon decoding. The order that modules are configured is always u7eTa™ 
and .S IX). DCC. OOB. OOC. MDP. JC. RO. PIC. * 

A.1 .3 Setting miscellaneous registers 

If an instruction is encoded to include the setting of a register value. .hen that twister is set 
donng microprogramming in sequence defined by the following rules: 

II^^ 8 '^ ,0 * M " u in 3 In0dule *" wouId not have any registers set 

UtentJutte^sterissetpiiortoatTyotberregistersettinss 
2. If the register to be set is in a module thai will have otter registers set. men the extra re°- 
•stcr is set afterother rcgisieis tat immediately prior to that module's _crg register. " 

A.1 .4 Inconsistent Instruction Operand Coding 

Many instructions have implied data types for operands and results, with other data types 
producing meaningless output. For each of the operands, the coprocessor determines dte 
intended format using the following procedure: "ermines rnc 

1. if the internal format of the operand is specified to be one of pixels, packed bytes or 
unpacked bytes, then the corresponding OpcraM Organiser will be set up to reflect this 

Wi " " 0t * COn/52ured and wi " ,hus cominuc to °P<«te in 

2. if the internal format of the operand is specified to be "other" then the coprocessor will 

» stnught forward For Operand A there is no naturally impHed "other" foS atS' *e 
coprocessor s behav.our is undefined. The corresponding Operand Organiser will be | c ft 
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Hi 

Microprogran«run 2 is reasonably ortho £ onal m taweeil Jle vanous modulejt 

A.1.5 Pseudocode syntax 

• The sequence of operations is eiven by the numlyrr ;„ i r 

• Register names are in hetvetic; boKf/ ^ ,eft mUlpn - 

* Register fields are denoted registej.fieid 

* I, D -Instruction word and Data word i«rwt,\..i., e- 

ex - the coprocessor lister number X 

- Cbus(X)- = do CBus operatiouX 

x - virtual memory location X 
• = unknown value, yet to be determined 

- "set* refers to the setting of the Data Manipulation Registers. 

A.2 Compositing Operators 

Notes 

»- Major opcode «QxC and OxD 

2. Opacity is considered to be the byte of highest address i p \< .h* « ♦ • 

3. ate a« aerator or tne opeLd nfggSS^i' ** "** b > ,e - 

4. Result may be un-pre-muldplied 

5. instruction length is defined in terms of the numbcrof input pixels 

Pixel Organiser: 
if I.R-0 then 
1- pojen <- OxOOOo : (.length 

endir 

if A_descriptor.S==o then 

2. po__dmr <- set 
endif 

3. po_said <- R 

4 - po_clg.mode <- sequential - c ^ 

. po.cfg.dDt <- mdp " SCt 9 ° ln 9 

DCC in «normal32* mode (default) 

operand Organiser B: 
if B.descrfptor.w other then not blend 

if B_descriptor.S = 0 then 

oob_dmr < - s^t 
endif 

7- oob_sa!d <- a 

if B_.descriptor.what - tile SpCCldl for compositing 

8 - oobjlfo <- B 
endif 
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9. 



10 



11 



IP 



oub_cfg.operatc <- operate 
endif 



- Operand Organiser c 
- There i s no implied ether date cype for Lt - is 
bo uced to specify any bit o£'-.^ i„ Vs«- 
if C_descriptor.il ■= oth~r then ™P- attenuate 



oocjen <- po_len 

set up the coc.drar 
if Cdescrltor.S « o then 

ooc__dmr <- set , „- 

endif - in =l««3i„g bo 

12. ooc_said <- c 

13. O0D_Cfg.operate <- operate 
endif 

Hain Data Path: 
if B_d BS criptor.lf - other then blend 

15. mdp_bm <- D ~ 

16. mdp_bl <- A 
end if 

17. mdp_cfg.tnstructfon <- I. opcode 
mdp_cfg.blendgen <- < B^deserlptor.if - other) 
mdp_dg.decode <- 1 

md P_cfgjoperat9 <- l 

Result Organiser: 
, " if R_dcsorltpor. 6 - o then 

18. ro_dmr <- set 
endlf 

19. rojsa R 

20. ro cfy.mode <- sequential 



A.3 Colour Space Conversion 

Motes 

2 - f °™' " m °" m ^ —"»<«» - »p« «-« „ r„* 

Pixel Organiser: 

" r^ri™*^ 5CnSe 33 >W ~ PlX — Whin. e lse 
if I.R « 0 then 

1. pojen <- OxOPOOrl.len 
endif 

if A_descripior.S = 0 then 

2. po_dmr <- set 
endif 

3. po__sald <- a 

4. po muv < - c 

c " ■ _ ~ " I & F rabies 

5 * P 0 _c1g mode <- esc 

po.cfg.dst <- mdp 
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lift 



" DCC setup: 

should be or.h^r 
if Djiescriptor.if « other then 
6- dec addr <- B 

dcc_cf 9 2xachc_missjn3t <- b descrjptor.C 

if LM • 0 then , 

. . . . single output channel 

dc C _cfga.mode <- single chan--=l CSC 

else 

dcc_cfg2.modc <- multi channel CSC 
endir 
endif; 

Operand Organiser B: 
-- operand B should other, anything else Is probably 

wrong but do a_t anyway 
if B_deseriptor.lf i~ other then 
8. oob^lert <- po I en 

if B__descrIptor.S - 0 then 
9- oob_dmr <- set 

endif 

10. oob_safd <- B 

11. oob_cfg.operate <- operate 
endif 

Operand Organiser C: 

Operand C should be other, anything else is probably 
vrong but do it anyway * 
if C_descriptor.lt i« other then 

12. oocjen <- pojen 

if C_descriptor.S = o then 

13. ooo_drnr <- set 
endif 

14. ooc_said <- C 

15. t>oc_cfg.operate <c- operate 
endif 

- - Main Data Path ; 

16. mdp_cfgjnstruct1on <- I. opcode 
mdp_cfg.decode <- i 
mdp_cfg .operate <- \ 

— Result Organiser: 

if R_descriptor.S - 0 then 

17. ro_dmr <- set 
endif 

18. ro_sa <- R 

19. ro^cfg.rnode <- sequential 



A.4 JPEG Instructions 

Notes 

\. Opcode = 0x2 

2. Operand C may a register id set 

3. Options are many: 
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in 

• subsamplmg ornoi 

• filtering or noc 

• 1,3. or 4 scan 

C TUesc i nstr.caons rcly 0n , vcraI regf stcrs t0 ^ up p . or [o ^ . )jiricr jM ^ ^ 

A. 4.1 Decompression 



Noles 

i. Tne following regiMcn should be set prior to this instruction being executed- 

• ro_.dn output image dimensions nosier * «*cuiea. 

• ro_cut: output cut register 

■ rojmt: output limit register 

endif 

- * Pixel Organiser: 

".sr;;^ s the source compresced dat *< — - . 

if J.R = 0 then 

1. pojen <- Ox0O00:Uen 
endif 

if A_descripior.S = o then 

2. po_dmr <- set 
endif 

3. posaid < - A 

if there ia a register set targeted at £>o 
if (D.R - i) and (C^descriptor.topnibbte = 0x6) then 
**• eC_descriptor <- C 

endif 

5. po_cfg.mode c- sequential 
po_cfg.dst <- jc 

DCC setup: 
if B_descrlptor.lf = other then 
& dcc_addr c- B 

if there is a register set targeted at DCC 
it (D.R = i) and (0_de5criptor.topnibWe = 0x5) then 

^C descrlptor <- C 
endif 

6. dcc_cfg2.C <- B_descriptor.C 

dcc_cfg2.mode <- JPEG decode 
end if 

" operand Organiser B: 

-- °r^r d h % S i° Uld ^ 0thCr ' a " ything else is Probably 
vrong, but do it anyway 1 

if BjiescriptorJf f« other then 
9. oobjen <- polen 

if B_descrlptor.S = o then 
10- oob_dmr <- set 

endif 
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12. 



oob_said c - B 
oob_cfg.uperale <; - operate 

Operand Organiser C: 

- -- Operand c may be a register t* sot. If it i,n't it s h D ulc 

i"- aD J thlBB €l£S *» P*cb.bly wrong, but it anyw 

1, CR.D - 03 and < C_descriptc-r.fi != -her) -hen 

13. oocjon <- pojen 

if C_descriptor.S ^ 0 then 

14. ooejmr <- set 
endif 

15. ooo_sald <- C 

ooc^cfg.operate <- operate 
^-ndif 

JC setup: 

if tnere is a register set targeted at JC 
if (D.R = 1) and <C descriptor^ p nibble = 0x9) then 

16. ec_descrlptor <- C 
endif 

17. Jo_cfg.lnstruction <- I. opcode 
jc_ofg.decode <-. l 
jc_efg,op€ra1e <- l 

Result Organiser; 
if R_descriptor.S - O then 

18. ro_dmr <- set 
endi f 

-if there is a register v/rirc targeted at ro 

19. ro_sa <- R 

if (D.R « i) and (C_descriptor.top nibble = OxB) then 

20. ec_dcscrjpior <- C 
endif 

21. ro_cfg.mode <- jpeg 
ro_cfg.chan <- I.Ms 1.4 
ro_cfg,upsarnpla <- |.S 
ro cfg.cut <- |.C 
ro_cfg.]|mft <- I.T 

.4.2 Compression 



Notes 

1. The following registers should be set priono this insiruciion being executed- 

• po_idr: output irnaae dimensions register 

• jP_rml: restart marker interval 

• ro cut: outpu! cut register 

• rojrnt; output limit register 

It 13 a re5ister set ««3««d at anything other than 

the JC, PO, RO or DCC 

if (D.R 3) ancl (C_descrlptOUopnlbt3le !« 0x5,0x6,0x9 or OxB) then 

eC_descriptor C 
endif 

Pixel Organiser: ( 
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Operand A is the source compressed data, and should be ^ 
nytr? stream a 

if t,R - 0 then 
1. po Jen <- OxCOOO : Lien 

«ndif 

if A_dQSCriptor.S - 0 rhen 
2- po_dmr < - set 

entfif 

3. po said <- A 

-- if there is a register set targeted at PO 
if (D.R - i) and (C_descrlplor\lopnlbWe = 0x6 ) then 

4. OC^descrlptor <- c 
endif 

5. po_cfg.mode <- jpeg 
po_cfg.dst <- jc 

po_c|g.SS <- |.8 - subsompling 

pp^cfg.chan <- !.M;M 
po_cfg.F <- LF 

' - DCC setup: 

if B_descrIptor.lf = other then 

6. dcc_addr <- B 

if there is a register set targeted at DCC 
if (D.R - 1) and (C_descriptor.topnfbble = OkS) then 
7- @C_descriptor <- C 

endif 

8. dcc_cfg2.cacho_miccJnst <- B^descriptor.C 
doo_ofg2.mode <- J PEG encode 

end i f 

-- Operand Organiser B: 

Operand B should bs other, anything else is probably 

vrong, but do it anyvay 
if B_descrlptorJl ! = other then 

9. oobjen <- pojen 

if B_descriptor.S - 0 then 

10. oob_dmr <~ set 
endif 

11. oob_safd <- B 

12. oob_cfg. operate <- operate 
endif 

Operand Organiser C: 

Operand c may be a register to set. If it isn't it should 
--be other, anything else is probably vrong. but do it 

- - anyway 

if (R.D » 0) and (C_descriptor.lt !- other) then 

13. ooc Jen <- pojen 

if C_descrlptor_S - 0 then 

14. ooe dmr <- set 
endif 

15. ooc^sald < - C 

16. ooo_cfg.operate < operate 
endi f 

- - JPEG coder : 

if there is a register set targeted at JC 
if (D.R - I) and ( C^descrlplor.topfifbbJe = 0x9 j then 
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"* 7 - t'C_descrlptor <;- C 

endif 

18. f c _ cfgJnstructjon <- I. opcode 
J c_cfg.doc ode < i 
jc_cfg.opern1e I 



Result Organiser: 
if R_dc3crip1or.S - 0 then 

ro dmr <- set 
r-ndif 
20. ro sg <- R 

~l ^nJ 1 ^ 6 1S * lister write targeted at ro 
?1 1 " a> < C - d6scrl P to r.topnibbre - OxB) then 

endif 

22. ro._cfg.modc <- sequential 
ro_cfg.cut <- l.c 
ro_cfg.fimJf <- | # T 



A.5 Data Coding 

Motes 

U opcrat r iom » whclhcr wn»n- or decompressing am dealt with in the 

same way, Setting up for these operations Jooks very srnUarto that for JPEG 

2. Possible encoding operations arc: 

• huff man encode 

• predictive encode 

3. Possible decoding operations are: 

• fast huffman decode 

• slow hufirnan decode 

• packbits decodi ng <versi on A) 

• packbks decoding (version B) 

• predictive decode 

4* Operand C may be a register to set 

^ ^^^^^^^^'^^'^ 

• fojmt: output limit register 

0. " icl:Jp X0 rt , c C - <,eSCript "- ,0pnlbble ! " 0X5.0X6,0,9 or OxB) then 
endif 

- - Pixel Organiser: 

Operand A is rhe source data 
if I.R = 0 then 

1. pojen <- 0x0000: Men 
end \ f 

if A_dascriptor.S = 0 then 

2. po.dmr <- set 
endi f 

3. po_sald <- A 

if thexe is a register set targeted at PO 
if (D.R - i| and C C_d esc rlptor.topnl bole - o x &) then 

4. eCj_escrfptor <- C ? 
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endif 

5. po_cfg.modn <- sequential 
poclg.dst <- jc 

" " DCC Setup; 

if B_descriptar.if = other then 

6. dcc_addr <- B 

•> . if theie is a register set targeted at DCC 

if (D.R = l) and ( C^dcscrlpior.top nibble « 0x5) then 
7 « ec_desodptor <- C 

endif 

8. dcc_cfg2.cache missjnst B_descrlptor,C 
dcc_Cfg2.mode <- en/dc coding 

else 

if there is a register set targeted at DCC 
if (D.R = 1) and ( C_descrl ptor.topntoble = 0x5) then 

G (^descriptor <- C 
endif 

dcc_cfg2.cache_missjnst <- B_descriptor.C 
endif ^ 

Operand Organiser B : 

operand B should be other, anything else is probably 
wrong, but do it anyway 
if B_descriptor.it ! other then 

9. oob_1en < - pojen 

if B_descrlptor.S = 0 then 
10- oob_dmr <- set 

endif 

11. oobsaid <- B ' 

12. oob_cfg operate <- operate 
endif 

Operand Organiser C: 
- Operand C may be a register to set. If it isn't it should 
be other, anything else is probably wrong, ' but do it 
— anyway 

if (R.D = 0) and ( C_descrip1orjf !« oth*r) then 

13. oocjen <- pojen 

if C_descriptor.S = 0 then 

14. oocjdmr <- set 
endif 

15. ooc_5ald <;- C 

16. ooc_cfg .operate <- operate 
endif 

-- JPEG Coder: 

--•if there is a register set targeted at jc 
if (0.R - l) and < C_descriptor.topnibble = 0x9) then 

17. <?C_descrfptor <- C 
endi f 

18. jeef ^instruction <- I. opcode 
Jc_cfg.decode <- i 
)c_cfg.operat© <- 1 

-- Result Organiser: 
if R_descriptor.S - 0 then 

19. ro_dmr <- set 
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enriif 
20, ro^sa <- R 

if there is a register write targeted at RO 
( D.n X) and ( C_dcscrjpior.topnfhble - OxE) then 
8C_dcscriptor < C 
endif 

22. ro„cfg,rnode <- sequential 
r °_cTg.cut <- 1.C 
ro_cfg.limil <- LT 



A. 6 Transformations and Convolutions 



Notes 

1 - Opcode - 0x4 (convolutions) and 0x5 (craraformations) 

2. the coprocessor performs an operation which is a superset of what i s required for each of 

^/Z^T ^ Co r luUOnS * AS far 85 Ihfi Wow h con^d the 
ad y difference between a naasformtion and a convolution is rha for a naiuformation 
15l 5t£ P Size Cho^onmJIy and vertically) is the si* of the kernel. wheSor a 
convolution, the srep size is one source pixel. a 

3. Options are: 

• iiuerpolaiing or snapping- to-closea pixels 

• pixel accumulation Ci.e, a kernel) or not 

• pre-multiplication or not of the source pixels 

• clamp, wrap or absolute value for determinins final result 

4 * tm f himmltx f * ld convoluUons'don-t work in place, U. if the source and 

destination pointers are the same it will break. 

-- Pixel organiser: 

Operand A in the kernel descriptor and the PO delivers 
kernel co-efficients to the KDP. This is coded as 
"other" 

L = 1 long format 
0 short format 
if LR - 0 then 

pojen <- oxoono:i.ien 

endif 

if A_descrlptor.S = 0 then 

po_dmr <- set 
endif 

po_said <- A 

poelg.mode <- con volution/ transformation 
po_cfg.dst <- mdp 

- - DCC setup: 

the implied data type here is the source image 
if B_descrlptor.lf - other then 

dcc_cfg2.cache_mj$sjnst <- B_dcscriptor.G 

if LS = o then 

dcc_cfrj2,mode <• 64 bit mode 

else 

dcc_cf02.rnode <- random mode 
endif 
else 

dcc_dg2.cache miss inst <- B descriptor C 
endif 
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Operand Organiser B: 

oi\2 P ^ rand ? ?f a pointer ro source image (other |, anythinu 
else is probably wrong, bur do it anyway 7 ny 

if S_descriptor.il {« other then 
6- oobjen <- pojen 

if B_descripior.S fc 0 then 
7. oob_dmr <- set 

endif 
0- oub said <- B 

9. oob_cfg. operate <- operare 
end it 

- - Operand Organiser C: 

operand C descriptor is borrowed by the r.c field 
--CO COL" is not set up 

Main Data Path: 

— kernel descriptor is delivered to mdp via po 

10. mdp Jen <- pojen 

11. mdpj>m.rows <- C_descrlptorj 
mdpJ>m.cols <- Cdescrlptor.c 

12. mdp_cfg.rnstructlon <- I. opcode 
mdp_cfg.long_krnJ <- A^deseriptor.L 
mdp_cfg.decode <- i 
mdp_cfg.opera|e <- 1 

Result Organiser: 
-- result will be pixels or part thereof 
if R_descriptor.S - 0 then 

13. ro_dmr <- set 
endif 

14. ro sa <- R 

15. ro_cfg.mode <- sequential 



A.7 Matrix Multiplication 



Notes 

]. Opcode = 0x3 
1. Options are: 

• pre-rnulriplicaiion or not of the source pixels 

- clamp, wrap or absolute value for determining final result 

• Operand C may be a register write 

if there is a register set targeted at anything other than 

the MDP, PO, RO or DCC 
if (D.R = l) and ( C_descrlptoUopnibbl9 •= 0x5,0x6, OxA or OxB) then 

<sC_descriptor <- C 
endif 

- - Pixel Organiser : 

Operand A i s the source pixels, only makes sense to have 
vhoLe pixels, anything else is probably vrono 

if in = 0 then 

pojen 0x0000; Lien 

endif 

if A_descriptor.S - 0 then 



-517- 



176 



2- po_dmr <- seL 

endiC 
3. po^said <- A 

if there is a register set tdjcsled at PO 

it CD-R - 1) sod (C_descriptor.topn«bble « 0x6) then 
4- eC_descriptor <- C 

endif 

5. F*_olg.modc <- sequential 
po_o1g,dst < mdn 

DCC setup: 

" the implied other data type is a matrix of coefficients 
ir B_descf1ptor.lt - other then 

6. dcc_addr <- B 
end if 

if there is a register write targeted at DCC 
if <D.R = 1) and (^descriptor, top nibble - OxS) then 

7. SCdescrJptor <- 0 
endif 

if B_descrfptorJf - other 

8. doo_pfg2.cache_mi33_Jn3t <- B^descriptor.C 
dcc_cfg2.mode <- matrix multiply 

else 

dcc_cfg2.cache__mlssjnst <- B descriptors 
endir 

Operand Organiser B: 

Operand B is the matrix co-ef ficients (other), 
anything else is probably vrone, but do it an way 
if B_dBscriptor.ff != other then 

9. oobjen <- pojen 

if B w descriptor»S - o then 

10. oob^dmr <- set 
endif 

11. oob said <- B 



12, 



16. 



oob_cfg.operate <- operate 
endif 



Operand Organiser C: 

operand c may be a register to set. If it isn't it should 
— be other, anything else is probably vrong hut ao it anyvav 
if (RJ5 « oj and (C_descrIptor.il !- other) then 

13. ooc_len <- po_len 

if c_descrIptor.S = 0 then 

14. ooo_dmr <- set 
endif 

15. ooc_said <- C 



ooc_cfg.operate <- operate 
endif 



KDP setup : 

-- if there is a register set targeted at KDP 
if (D.R = i) ana (C descriptar.topnlbble = Oxa) then 

17. SO descriptor <- C 
endir 

18. mdp_cfg. Instruction <- I. opcode 
mdp__cfg. decode <- 1 
mdp_cfg.operate <~ 1 



-518- 



Result Organiser: 
j f R_descrip1or.S = 0 then 

19. ro_dmr <- set 
endif 

20. ro_sa <- fl 

it there is a register write targeted at ro 
if (D.R = 1) and ( C_descrlptor.topnfbbJe - OxB) then 

21. ^descriptor <- C 
endif 

22. ro_ctg.made <- sequential 

A.8 Halftoning 

Notes 

1. Opcode = 0x7 

Z Only option is the number of levels of halftoning 

3. can be done on pixels or bytes as long as the halftone screen is appropriately umncshed 
or meshed. 

Pixel Organiser: 

A operand is the source pixels or bytes 
--" PO operates in conventional secuential mode 
if I.R = 0 then 

1. pojen <- 0x0OO0:Uen 
endif 

if A_descrtptor.S - 0 then 

2. po_dmr <- set 
endif 

3. po_said < - A 

4. po_cfg.rnode <- sequential 
po_cfg.dst c- mdp 

DCC setup: 

no implied other data type 
--if there is a register vrite targeted at DCC 
if (D.R - i) and (O.descrlpior.topnlbDle = 0x5) then 

5. <SC_descrfptor <- C 
endif 

if B_descriptor.C ^ o then 

6. dcc_cfg2xache_mfss inst <- B descriptors 
endif 

Operand Organiser B: 

Operand B is the halftone screen, and should be a tile 
--anything else is probably wrong . There is no sensible 

implied "other" data type for halftoning, 
if B_descrlptorJf ! = other then 

7. oobjen < pojen 

i t 0_descriptor.S - 0 then 

8. oob_dmr <- set 
endif 

9. oot>_said <- B 

if R_riescriptor.what = tile 

10. oobjile <- C 

endif t 
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. m 

oob_cfg.operate <- operate 
endif 

Operand Organiser C: 
- Operand C word is borrowed by - ne tiled Operand 0 
The C_descriptor is cempletelv ignored 
-- OOC is in random .-node (default) and vont be touched 

Main Data Path: 

12. mdpbmJevel <- Llevels 

13. rndp cfg.lnstructcon <- I. opcode 
rndp_cfg,decode <- 1 
nidp_clg.operale <- l 

*- Result Organiser: 
if R_descrlptor.3 = o then 

14. ro_dmr <- set 
endif 

15. ro_sa <- R 

16. ro_cfg.mode <- sequential 

A.9 Memory Copy 

Notes 
- 1. Opcode = 0x9 

2. TOs instruction uses two quite distinct mechstsms io achieve a memory copy operation- 

• A General Data Transfer utilises the normal data Row path through the coprocessor and 
so can utilise the various functions associate vith the data manipulation units in the PO 
and RO. 

* A Peripheral DMA utilises a direct connection between tic PIC and the LMC This means 

that no data manipulation can be perfouned, and that this operation may be ovcriapoed 
with subsequent instructions. r * 

A.9.1 General Data Transfer 

" Pixel. Organiser : 

-- A operand is the source c=ta 

PO operates in ccnvention-1 sequential mode 
it I.R - o then 

1. pojen <- 0x0000:1. (en 
endif 

if A_descrIptoi.S = 0 trien 

2. po_dmr c- B 
end if 

3. po_said <- A 

4 - po^cfg.mode <- sequential 
po_cfg.dst <- j c 

-- Operand Organiser B : 

Operand B word represents a value to put in po_dmr 

Operand Organiser C: 
-- Operand c word represents the value to put in the ro_dmi 

- ■ JPEG Coder : 
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5 - i<;_cfg.lnslruction <- I . opcode 
jc_cfg.1bo <- D.bo 
jc_cfg.ono <- l.obo 
jc_cfg.decod© <- 1 
jc_cfg.operale <- 1 

Result Organiser: 
if R_des crip tor. S = 0 then 

6. ro_dmr <- C 
end 

7. roja < - R 

8. ro_cfg.mode sequential 

A.&2 Peripheral DMA 

Notes 

1 . May be overlapped or noL This is dealt with by the IC 

2. Operand C may be a register to set 

y This instruction is different from other "activity" instruction in that the PIC is the mod- 
ule that drives the data. 

only setup required is for the PIC: 
if I.S = 1 then - - eata in from Pic 

1. plcjibus_addr <- R 
else 

pic_abus_addr <- A 
endi f 

if there is a register write targeted anywhere 
if D.R = 1 then 

2. <?C_descrtptor <- c 
endif 

if LR = 0 then 

3. pic_abus_cfg.ab_count <- 0x00 : l.length 
endif 

p i c_a bus_c Jg. a bjbyx e_en <- l.nyte 
pic_abus_cfg.abjype <- I.R 
plc_abus_cfg. start o 1 
Pixel Organiser; 

Operand A is the source address given to the PIC 
The PO Ls not setup 

*- Operand Organiser B: 
OOB is not setup 

-- Operand Organiser C: 

- - OOC is not setup 

Kesult Organiser: 

The Result word represents the tarqet address 

- - RO is not set up 
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A.10 PhotoCD Decompression 

This family of instructions consists of uVee differeii operations; horizontal interpolation 
venicaJ interpolation and residual merging. As it happens, the setup for Vertical irv.erpola- 
tton and the scmp for residual merlin? are identic^. 

Opcode for all these instnictions is 0x9 

A. 10.1 Horizontal Interpolation 

Notes 

1. can operate on pixels or bytes 

2. this is a one operand instruction so Operand C may be a register to set 

if there is a register set targeted at anything other than 
-- the KDi\ PO or RO 

if [DM - and (C_descriptor.topnibble i= 0x6, OxA or OxB) then 

0. €C_descrlplor <- C 
endif 

- - Pixel Organiser: 
if in - 0 then 

1. pojen <- OxOOOOjIJen 
endif 

if A_desct[ptor.S = o then 

2. po_dmr <- set 
endif 

3. po_said <- A 

if there is a register ear targeted at t'O 
if (D.R - 1} and ( C_descrlpt or.top nibble - 0x6} then 

4. GCjtescriptor <- C 
endif 

5. po_efg.rnode <- sequential 
po_cfg.dst <- mdp 

Operand Organiser a: 

Operand B word is completely ignored 
The B_desoriptor is completely ignored 

Operand Organiser C: 

Operand c m&y be a register to set up 
Mnin Dara Path: 

if there is a register set targeted at MDP 
if (D.R = i) and (C_descriptor.topnlbble * 0xA> then 

6. SC^descrlptor <- C 
endif 

7. mdp_cJg.instruction <- L opcode 

if A_descriptorJf /= packed bytes then— tell radp about s^e 

mdp^cfg.wordjnput <- l 
else 

rndp_cfg.word_tnput <- 0 
endif 

mdp cfg.docodG <- 1 
mdp_cfg. operate <- 1 

Result Organiser: < 
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ISI 

i t R descriptor^ = o then 

8. rodmr <- set 
endif 

9. ro_sa <- R 

10. ro^ctg.mode <- sequenLial 



A.10.2 Vertical IntQrpolation and Residua! Merging 

Notes 

L The setup for Vertical Interpolation and Residual Mcrein* is identical. 

2. Can operate on either bytes or pixels 

3. This is a two operand instruction so Operand C may be a register set 

H a re9ister SCt targeted at anything other than 

the MDP, PO, OOB or RO 

if (D.R - 1) and (Cjiescriptor.topnJbble != 0x6,0x7.0xA or OxB> then 

0. <?c_descriptor <- C ; 
endif 

Pixel Organiser: 
if I.R - 0 then 

1. po Jen <- 0x0000: Lien 
endif 

if A_.descriptor.S - 0 then 

2. po_drnr <- set 
endif 

3. po_said <- A 

if there is a register set targeted at po 
if [D.R = i> and (C_descrlptor.topnibble = 0jc6> then 

4. eC_descrtptor <- C 
endif 

5. po_cfg.rnode <- sequential 
po_cfg.dst <- mdp 

- - DCC setup : 

--no implied other data type 

-- if there is a register write targeted at DCC 
if (D.H - 1) and (C_de5criplor.topnlbWe = 0x5) then 

6. GCdescrlptor <- C 
endif 

LC B_descriptor.C =* o then 

dcc_cfg2.cache_miss_inst <- B_d«rc;criptor.C 
endif 

-- Operand Organiser B : 

-- Operand B is the residuals or one operand for 

interpolation, anything else is probably wrong, but 

- - do it anyway 

-■ There is no sensible implied "other* data type for 
-- this. 

if B_descriptor.iT ! = other then 

7. oobjen <- po ten 

if B_d«scrlptor.S - o then 

oob dmr <- 5 eh 
enriS f 

0. oob ^ald <- B 
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endif 

if there is a register sot tarc^ed at ODB 
if [D.R = 1) arid (C_descriptortopnibble = 0x6) then 
1U - ec^dcscripior < c 

endif 

11- oob_cfg.operale <- cceri:$ 

endif 

Operand Organiser C: 
may be a register to set 

- - Main Data Path: 

.-- if there is a register set taree- e d at MDP 

» lf 1°^ = 1} ( C - descr ^r.topnibb!e = OxA) then 

iZ- 8C_deserfptor <- C 

endif 

13. mdp_cfg.inst ruction <- I . opcode 

if A descriptor.]* /» packed.bytes then- tell md P about 

^P_cfg.vyord_tnput <- l 
else 

rndp^clg.wordjnput <- o 
endif 

™dp_crg.decode <:- 1 
indpcfg. operate <- 1 

Result Organiser: 
if n_deacriptor.S *- 0 then 

14. ro_dmr <- set 
endif 

15. ro_sa "<* R ' 

16. ro_cfg.mode <- 0 sequential 

A,11 Control Instructions 



Notes 



I. Control Instructions consist of two classes of operations: Row Control Instructions and 
Internal Access Instructions. 



A.11.1 Flow Control 



Notes 

1. Opcode = 0xB 

2. Flow Control Instructions currently consist of Jumps of various kinds and Waits of van- 
ous kinds. 

3. No explicit setup is done in the coprocessor, and this instruction is not an "activity" 
instruction, ic no the coprocessor subrnodules other than the instruction controller actu- 
ally do anything. 

4. Operand C may be a register to set 

only thing to do is : 
if D.R - l then 
0. eC_descr!ptor <- C 

endif 

-- Pixel Organiser: 
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*- no setup associated with Operand A 

Operand Organiser 5: 

no setup associated with Operand b 

Operand. Organise: C: 

no st;Lup associated with Operand C 

Result Organiser: 

no setup associated vith the Result 

A.11.2 fnternal Access: Read 



Notes 

1. Opcode = (kA 

2. A "read" i t a transfer of data out of the coprocessor 

3. Tne only module in the coprocessor that actually does anything For this istneRO 

Pixel Organiser: 
-- no setup associated with Operand K 

Operand Organiser B; 
--no setup associated with Operand B 

Operand Organiser C; 

no setup associated with Operand C 

Result Organiser , 

1. ro_subst <- Cbus(A) 

2. ro_sa B 

3. ro_cfg.mode <- Cbus_data 



A.11.3 Internal Access: Write 
Notes 

1. Opcode = OxA 

2. A "write" is a transfer of dcta into the coprocessor. 

3. TOs iistniction is not an "activity instruction as no modules other than the IC actuaJ] y 
flo anything. J 

1. Cbus|A) <- a 

Pixel Organiser: 

no setup associated v/ith Operand A 

-- operand Organiser F1: 

no setup associated with Operand B 

Operand Organiser C: 

no setup associated with Operand C 

Result Organiser: 
- no setup associated vlth the Result 
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A.12 Reserved Instructions 

Notes 

1 . Opcodes 0x0, OxE and OxF arc "reserved" 

2. A <l ~«-—— - 

3. 



ui w i ^oui \ CVJ . 

'reserved" instruction will generate a (maskable) error 
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IBS' 

Appendx B: Registers 
1.1 Registers and Tables 

^ctST dCSCribeS C ° PTOCeSSOf re?iS,CrS - T:a$C regiS,erS Can * « one of 

1. Specific the coprocessor instructions exist for reading and writin« rensters Bv u d„. 
Jhese mictions these regis te re can be read or written using taSiSgS 

2. Many registers change value as a side effect of instruction execution. The primary rnech 

vanous registers to reflect Ok currently executing instruction. At the end of the instruc 
Qon, the agisters reflect the state of the coprocessor at that time. 
Most typical operations can be completely specified and set up by the one instruction. In 
some cases. It is reoessary * set some tenets immediately pL I executing SSuc- 

Semantics of "reserved" register bits 

Any registers or parts thereof that are "reserved" hsvs the following semantics: 

• a write to a reserved location will succeed and the data will be discarded 

• a read from a reserved location will IU cceed and will result in undefined value 
All unspecified registers and register fields are reserved. 

1.1.1. Register Taxonomy 

in this section. In these descriptions: 

• «te^>y -extend to mem by 

die Instruction controller or via tie sterna] Cbus Inrerface. Note that the re»i sters can't 
be «t via bit-set (Type C) mode from the PCIbus. ° 1 

• internally = internally to the module 

Status Register 

A status register is read-only externally and read/write internally. 
Configl Register 

A Configl register is read'wriie externally and read only internally. 

1*^1 ? °" ^ ^n^pt C Cbus operation, (ic docs not suppon bit *et 

zsszsz rcs,£ters ,hat hoid by,e <° r bisscr> **. -^-ion 

Config2 Register 

A Confi S 2 register is also read/write externally and iead only internally. 

A Configl register suppons Type C Cbus operations (ie supports bit set mode) and is used 
foresters that hold configuration information that may need to be set on a bit by bi! ha * 

Controll Register 

A Controll register Is readable/writable both internally and externally. 
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A Control 1 register does no! support Type C Cbus operations <ie docs not support bit sci 
mode) and is used for registers lha, hold byte (or bigger) ,ype control information suc |, as 
address values. a - 



1.1.2 



Control2 Register 

A Control register is readable/writable both internally anl externally. 

A Control register supports Type C Chus operations <ie supports bit sc. mode) and is us-d 
for registers that hold control information that may need to be set on a bit by hit basis. 

Interrupt Register 

The bit. within an , Interrupt register aic sellable to 1 imemally and rcsettabic to 0 externally 
by writing a 1 to that btL y 

Each of the module interrupt/error registers are of this lype. Each module's intermpt/crror 
register consists of three fields: 

17:0} represents any errvr conditions generated by the module 
{23:8] represents any exception conditions generated by the module 
{31 :24] represents any interrupt conditions generated by the module. 

Register Map 

Table l.i shows the coprocessors re*isten.The "number" inTablel.l is the number of the 
register rather titan a byte address, U1 uc 



Table 1.1 the coprocessor registers 




0x00 



0x01 



0x02 
0x03~ 



0x04 



0x05 



0x06 



0x07 
0x08 



ejc_cfg 



ac stat 



cic_errjrtt 



c*c_crr_int_cn 



eic_test 



elc_gen.pob 



eicJiisruaddr 



0x09 



OxOA 



eic_wtlb_v 



eic_wrlb_p 



eic rnmi! v 



Coni]g2 



Status 



Interrupt 



Config2 



ConfigZ 



ConfIg2 



Configl 



Control2 



Config2 



Status 



Confi juration 



Status 



Error and Inteirupt Status 



_Error and Interrupt Enable 



Tcjt modes 



Generic bus programmable output bits. 



Dual address cycle offset 



Virtual address and operation bits for TLB 
Invalidate/Write 



Physical address and control bits for TLB 
Write 



Most recent MMU virtual address translated, 
and current LRU location. 



OxOB 



OxOC 



OxOD 



eic_mmu_p 



Stains 



eicjp^addr 



Most recent page table physical address 
fetched by MMU. 



eic_jp_uddr 



Status Physical address for most recent IB us access 

to the PCI Bus. 

Physical address for most tecent RBus access" 
to the PCI Bus. 



Status 
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Table 1.1 the coprocessor registers 













OxOE 


eicjg_addr 


Status 


Address for most recent IBus access to tlx; 
Generic Bus, 




OxOF 


cic_rg_data 


Status 


Address for most recent RBus access to the 
Generic Bus. 






Local Memory Controller Registers 


page 32 


Ox 10 


lmLcfg 


Concro!2 


General configuration register 




OxU 


ImLsts 


Status 


Genera] status register 





0xl2 


IrnLerrJnt 


Interrupt 


Error and interrupt status register 




0x13 


ImLcrrJnL.cn 


Control2 


Error and interrupt enable register 




0x14 


tmi_dcfg 


ControlZ 


DRAM configuration register 




0x15 


IniLmodc 


Conirol2 


SDRAM mode register 




0x16 










0x17 










0x18 










0x39 










OxlA 










OxlB 










one 










OxlD 










Ox IE 










OxlF 












Peripheral Interface Controller Registers 


page 36 


0x20 


pic_cfg 


Config2 


Configuration 




0x21 


pic_stat 


Status 


Status 




0x22 


pic_eir_int 


Interrupt 


Inteirupt/Error Status 




0x23 


pic_enjnt_en 


Config2 


Interrupt/Error Enable 




0x24 


pic_abus_cfg 


Control2 


Configuration and control for ABus 




0x25 


pic_abus_addr 


Coring 1. 


the coprocessort address for ABus transfer 




0x26 


pic_cent_cfg 


Control2 


Configuration and control for Centronics 
mode. 




0x27 


pic_cenr_dir 


Config2 


Centronics pin direct control register 




0x28 


pic_rcverse_cfg 


Control 2 


Confi guration and conuol for reverse (input) 
data transfers 




0x29 










Ox2A 


picjimerO 


Config! 


Initial data timer value. 




0x2B 


pic.jimerl 


Cortfigl 


Subsequent data timer value. 














0x2D ! 










0x2E 










0x2F 












Miscellaneous Module Registers 


pages 


0x30 


miti_cf g 


Config2 


Configuration Register 
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• 1 the coprocessor registers 





> l ;.namo • 




- \ ^ > ^:L : "< description < ? 


: seepage 


0x3 1 


mm_siat 


Status 


Status Register 




0x32 


mrn_err_inL 


Interrupt 


Error and Irueirupi Register 




0x33 


jnm_err_ini_en 


ConTig2 


Error and Interrupt Masks 


— , 


0x34 


mm__gcfg 


Config2 


Global Configuration Register 




0x35 




Config 


i-vicjjiua>uu v^uiiiigurauOli Ke^ister 




0x36 


rnrn_grsi 


Config* 


Global Reset Register 




0x37 


mrn_gerr 


Config2 


Global Error Register 




0x38 


mrn_gexp 


Config2 


Global Exception Resis-er 


— 


0x39 


mrn_gint 


Config2 


Global Imemipt Register 




Ox3A 
0x3 B 


mm^active 


status 


Global Aciive signals 




0x3C 










0x3D 










0x3E 










0x3F 












In 


struction Controller Registers 
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0*40 


i<L_cfg 


Coiifjg2 


Configuration Register 




0x41 


ic_stat 


Status/ 
. Interrupt 


Status Register 




0x42 


ic_errjnt 


Interrupt 


Error and Interrupt Re.dster Ovrite to clear 
error and interrupt) 




0x43 


ic_errjnt w cn 


Config2 


Error and Iixerrupt Enable Register 


- 


0x44 


ic_ipa 


Control 1 


A stream Instruction Pointer 




0x45 


ic_tda 


Coiifigl 


A stream Todo Register 




0x46 


icjfna 


Conrroll 


A stream Finished Register 




0x47 


ic^inta 


Config l 


A stream Interrupt Register 




0x48 


icjoa 


Status 


A stream Last Overlapped Instruction 
Sequence number 




0x49 


icjpb 


Control 1 


B stream Instruction Pointer 




0x4A 


ic_tdb 


Configl 


B stream Todo Register 




0x4B 


ic_fnb 


Control 1 


B stream Finl shed Register 




0x4C 


4cjntb 


ConTigl 


B stream Interrupt Register 




0x4D 


icjob 


Status 


B stream Last Overlapped Instruction 
Sequence number 






ic_sema 


Status ; 


A stream Semaphore 




0X4F 


ic_scmb 


Status 


B stream Semaphore 






Data Cache Controller Registers 
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0x50 


dcc_cfgl 


config2 


DCC configuration I register 




0x51 


dcc_sut 


status 


state machine status bits 




0x52 


dcc_erMni 


status 


DCC error status register 




0x53 


dcc_crMnt_cn 


control 1 


DCC error interrupt enable bits 




0x54 


dcc_cfg2 


control 2 


DCC configuration 2 legister 
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the coprocessor registers 



iilliii 








, ' '•" ~- %•» v '■ 


0X55 
0x56 


Cec_arldr 
dec J vO 


conflgj 
control ! 


Base address register ;or special address 
modes. 

"valid" bit status for lines 0 to 33 




0x57 
Ox5S 
0x59 
0x5 A 
Ox5B 
Ox5C 
Ox5D 
Ox5E 
Ox5F 

0x60 
0x61 
0X62 
0x63 
0x64 
0x65 
0x66 


dccjvl 

<JccJv2 

dccjv3 

dcc_jaddrb 

dcc_raddrc 

deepest 

po^cfg 

po_stat 

po_errJrit 

po_errJnL_en 

po_dmr 

po_subsr 


control 1 
control 1 
control 1 
status 
status 
control 1 

Pixel Organ 
Config2 
Status 
Interrupt 
Confix 
Confjg2 
ConTtg2 


"valid" bit status for lines 32 to 63 
"valid" bit status for Jbes 64 to 95 
"valid" bit stern* for lines 96 to 127 
Operand Organiser B request address 
Operand Organiser C request address 
DCC test register 

iser Registers 
Configuration Register 
Status Register 

Error/Interrupt Statos Register 
Error/Interrupt Enable Register 
Data Manipulation Rejister 
Substitution Value Register 
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0x67 
0x63, 
0x69 
0X6A 
0x6B 
0x6C 
0x6D 
Ox6E 
0x6F 

0x70 

0x71 . 

0x72 

0x73 

0x74 

0x75 


po_cdp 
pojen 
po_said 
po_idr 

po_muv_valid 
po_muv 

Op 

linK /» Pit 

OQO_Clg • 

oob_stat 

oob_errJm 

oob_err_int_en 

oob_dmr 

oob_subst 


Status 
CoiELro! I 
Control 1 
Control 
Control2 
Configi 

erand Organ 

Config2 

Status 

Interrupt 

Conflg2 

Config2 

Cunfig2 


Current Data Pointer 
Length Register 

the coprocesson Address or Immediate Data 
Image Dimensions Register 
MUY valid bits 

Base address of MIA' RAM 1 

iser B Registers 
Configuration Register 
Status Register 
Erroi/Inierrupt Register 
Error/Interrupt Enable Register 
Data Manipulation Register 
Substitution Value Register 
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0x76 
0x77 
0x78 
0x79 


oob_cdp 
oob_len 
oob_said 
□ob_iile 


Status 
Control 1 
Control t 
Control 1 


Current Data Pointer 

Input Length Register 

Operand the coprocesson Address 




Ox7A 






filing length/offset Register 
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Table 1 .1 ttie coprocessor registers 



■.numbers 
0x7B 






"lili|ir-:M ^description* T 


■■zr : 


0a7C 








1 


0x7D 











0x7E 










0x7F 








— 




Operand Organiser C Registers 


pace 46 


0x80 




CottfIg2 


Configuration Register 




0x81 


ooc_scai 


Status 


Status Register 





0x82 


ooc_err_int 


Interrupt 


Error/Interrupt Rcgisier 




0x83 


ooc_eirjnt_en 


Config2 


Error/Interrupt Enable Register 




0x84 


ooc_dmr 


Confjg2 


Daw Manipulation Register 




0x85 


ooc_subst 


Config2 


Substitution Value Register 




0x86 


ooc_cdp 


Status 


Current Data Pointer 




0x87 


oocjen 


Control 1 


Input Length Register 




0x88 


ooc_said 


Control! 


Operand the coprocesson Address 




0x$0 


ooc_tile 


Control 1 


Tiling length/offset Register 




0x8A 










OxSB 











0x8 C 










OxSD 











Ox8E 










Ox8F 












J1*£G Coder Registers 
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0x90 




Conflg2 


configure ion 




0x91 


jc_stat 


Status 


status 




0x92 


jc_err_int 


Interrupt 


error and interrupt status register 




0x93 | 


jc_err_im_en 


Config2 


error and interrupt enable register 




0x94 


jc_rsi 


Configl 


rethe coprocesson inierva] 




0x95 


jc_decode 


Control 2 


decode of curre nt instruction 




0x96 


JC_rcs 


Control 1 


residual value 




0x97 


jc_table_sel 


Control 


table selection from decoded instruction 




0x98 










0x99 










Ox9A 










OxQB 










0x9C 










Ox9D 










0x9E 










0x9F 












Main Data Path Registers 
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OxAO | 


mdp_cfg 


Conflg2 


conflguraii on 
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Table 1 .1 the coprocessor registers 











; number 
Ox A I 
OxA2 
0xA3 


; name _ 
™dp__stat 
™dp_err_jnt 
mdp_exT_irK_en 


Status 

Interrupt 

Config2 


: t r .T";.--' r,r '"- ? v f v .~~ " 

— - >' description v 
status 

error/imerrupi 


. 


0xA4 
0XA5 
0XA6 


rndp_test 
nidp_opl 
mdp_op2 


Control 2 
Control 2 


error/interrupt enable 
test modes 
current operation l 
current operation 2 




Ox A 7 


ra dp_por 


Control I 


offset for plus operator 




Ox A 8 


nidp_bi 



Control 1 


blend the coprocessor, offset to index table 
entry 




Ox A 9 


mdp_^brn 


Controll 


blend end or number nfmwc ann ^idrtmr :~ 

v w * "WHtti ui hjws ditu columns in 

matrix, binary places, and number of levels in 
halftoning 




OxAA 


mdpjen 


Control! 


Length of blejnd to produce 




OxAB 










OxAC 




















OxAE 










OxAF 














Result Organiser Registers 
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OxBO 
OxBl ' 


to^cfg 
ro^stat 


Confi£2 
Scams 


Configuration Register 




OxB2 


ro_err_im 


Interrupt 


Status Register 
Error/Interrupt Register 




OXB3 
CXB4 


ro_err_Jnt_en 
ro_dmr 


Conftg2 
Config2 


Error/Interrupt Enable Register 
Data Manipulation Register 




OxB5 
OxB6 


ro_subst 
ro_cdp 


Coring 1 
Status 


Substiiution Value Register 
Current Data Pointer 




0xB7 


rojeii 


Status 


Output Length Register 




OxB8 


ro_sa 


Configl 


the coprocessor! Address 




OxB9 


rojdr 


Configl 


Image Dimensions Register 




OxBA 










OxBB 


ro^vbasc 


Configl 


the coprocessor Virtual Base Address 




OxBC 


ro_cut 


Configl 


Output Cut Register 




OxBD 


rojrat 


Configl 


Output Length Limit 




OxBE 










OxBF 












PCIl 


bus Configuration Space alas 




OxCO- 
OxCD, 
OxCF 






A read only copy of PCi configuration space 
registers 0x0 to OxD and 0xJ\ 
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OxCE 


pci_extemal_cfg 


Status 


32-bit field dowidoaded at reset from an 
external serial ROM. Has no influence on the 
coprocessor's operation.* 
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in 

Table 1 .1 the coprocessor registers 



.-•<- 

'"•' : n ti rn hp tit. 






)l|H*?vv< -a&eriDtion \<"; ^iTZZ A /^Vnnr' l 




In 


put Interface Switch Registers 


page 23 


Ox DO 




Cpnfig2 


Conriguration Register 




OxDl 


11 J Mill 


Status 


Status Register 




OxD2 


■ 1 C An* t t\» 

i ij LiiT^i m 


Ijitci iupi 


Interrupi/Error Starus Reeisier 




0aD3 


iis_ecrjni_en 


^Ouilg/ 






Interrupt/Error Enable Register 




0xD4 


iis_ic_addr 


Status 


Input address from IC 




OxD5 


iis_dcc_addr 


Status 


Input address from DCC 




OxD6 


iis„po_addr 


Status 


Input address from PO 




OxD7 


iis_burst 


Status 


Burst Length from PO. DCC & IC 




OxD8 


iis_base_addr 


Configi 


Base address of the coprocessor memory 
object in host memory map 




Ox DO 


ii shiest 


Configi 


Test mode register 














OxDB 










OxDC 










OxDD 










OxDE 










OxDF ( 
















OxEO- 
OxFF 






not used 





1.1.3 Register Definitions 



Miscellaneous Module Registers 
a. mm_cfg 

Table 1.2 mrn_cfg Register Fields 









- i 


E 


0=disabled — 
l=enabJed 


2 


8 


0=na bypassed 
1 -bypassed 
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t>. mm_stat 
Table 1.3 mm_stat Register Fields 









: V v v> Field , \ • 


Name 


$Mtr^ ~ ; v - ^ Description * 






Reserved 





c. mm_errjnt 
Table 1.4 mrn^errjnt Register Fields 

















Reserved "'" '■?•••■"»■*-, 



d. mm_err_int__en 
Table 1.5 mm_errjnt_en Register Fields 























Reserved "" " - 



e. mm_gcfg 



Table 1.6 mm_gcfg Register Fields 









0 


use__pci_clk 


(read only; value of the use_pci_clk signal ~~ 
0=use clJc_in to generate elk 
l=use pci_clk to generate elk 


I 


sIow_dk 


(read only) divide source clock by 2 to eenerate clkr 
0=divide by 2 (default) 
l=do not divide bv*2 


2 


cbus_confjg 


(read only) 

0 = generic bus configured 

1 = external cbus configured 


31:1 




Reserved 
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Table 1.7 mm_diag Register Fields 



Field;,,. 


J Name : /; 


— — , - : Description — 


1:0 


diag__icfg[l:0] 


program which diagnostics appear on the diag_i pin: "~ 
00= the coprocessor busy 
01=new A stream instruction strobe 
10-new B stream instruction strobe 
1 1 =new either stream instruction strobe 


3:2 


diag_dcfg[l:0] 


program which diagnostics appear on the diag_d pins: 

OO^aciivity diagnostics: 

d ia g_d [ o J = PCI bus master transaction 

diag_d ( 1 ] = generic bus transaction 

diag_d(2J - locaJ memory transaction 

diag_d ( 3 ] = peripheraJ interface transaction 

diag_d[4J « non overlapped instruction active 

diag_d { 5 J « overiapped instruction active 

0 1 instruction diagnostics: 

diag_d [3:0] = current instruction opcode 

diag_d [ 4 ] « current instruction stream 

diag_d[5J = unused 

JO«caching diagnostics: 

dxag_d[0J «= data cache hit 

diag_d(l) = servicing data cache miss 

diag_d[2J « TLB hit 

diag_d[3] = servicing TLB miss 

diag_d[4] -MUVhit 

diag_d[5] = servicing MUV miss 

11 -stall diagnostics: 

d iag_d ( 0 ] = waiting on output from PO 

ag_d [ i ] = waiting on output from OOB 
<Hag_d[ 2) = waiting on output from OOC 
diag_d[3] = stall due to RO fifo full 
diag_d [ 4 ] = waiting on instrucdon fetch 
diag_d[5J = unused 
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rt5 

g. mmjgrst 
Table 1.8 mn\_grst Register Fields 



1 1 1 Field I 1 






15:0 


msrst(15:0) 


Per-moduJe sofrwart resets. 






Write 1 to appropriate bit to cause software reset. These 






bits are self clearing. 






bit 0 = External lr.:erface Controller 






bit 1 = LocaJ Memory Controller 






bit 2 = Peripheral Interface Controller 






bit 3 ■» Miscellaneous Module 






bit 4 - Instruction Controller 






bit 5 = Data Cache Controller 






bit 6 *= Pixel Organiser 






bit 7 «= Operand Organiser B 






bit 8 = Operand Organiser C 






bit 9= JPEG Code: 






bit 10 = Main Dat2 Path 






bit 1 1 « Result Organiser 






bit 12 = reserved 






bit 13 = Input Interface Switch 






bit 14 = reserved 






bit 15 = Software Reset alt modules 
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114 

h. mm gerr 



Tablft 1.9 mm^err Register Fields 







— : : ^cnpt,on. , , 


31:16 


crr{i5:0] 


Pcr-rnodu!e error status bits: 

Note that these bits are read only. To clear the error condition 
the appropriate module error bit should be cleared. 

0 = no error condition from this module 

1 = error condition from this module 
bit 0 = External Interface Controller 
bit 1 « Local Memory Controller 

bit 2 « Peripheral Interface Controller 

bit 3 = Miscellaneous Module 

bit 4 = Instruction Coniroller 

bit 5 - Data Cache Controller 

bit 6 = Pixel Organiser 

bit 7 = Operand Organiser B 

bit 8 « Operand Organiser C 

bit 9 = JPEG Coder 

bit 10= Main Data Path 

bit 11 = Result Organiser 

bit 12 - reserved 

bit 13 « Input Interface Switch 

bit 14 = reserved 

bit 1 5 «= error on any module 


15:10 


err_enU-*>:OJ 


Per- module error enables: 

0 * error conditions from this module are nor translated into an 
imemiptofthehost 

1 - error conditions from this module are translated into an 
interrupt of the host 

bit 0 = External Interface Contmller 

bit l « Local Memory Controller 

bit 2 = Peripheral Interface Controller 

bit 3 - Miscellaneous Module 

bit 4 = Instruction Controller 

bit 5 = Data Cache Controller 

bit 6 Pixel Organiser 

bit 7 = Operand Organiser B 

bit 8 = Operand Organiser C 

bit 9 = JPEG Coder 

bit 10 = Main Data Path 

bit 11= Result Organiser 

bit 12 = reserved 

bit 13 = Input Interface Switch 

bit 1 4 m reserved 

bit 15 = interrupt on any module 
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m 

f. rnrn_gexp 
Table 1.10 mm_gexp Register Fields 



i: Field 


< 1 Name- - : 


/ ^ ( r ~ Description - - * 4 - 


31:16 


exp[l5:0] 


Per-module exception status bits: 

Note thai these bits are read only. To clear the exception condi- 
tion, the appropriate module exception bit should be cleared. 

0 = no exception condition from this module 

1 = exception contii: ion from this module 
bil 0 « External Interface Controller 

bit I = Local Memory Controller 

bit 2 = Peripheral Interface Controller 

bil 3 = Miscellaneous Module 

bit 4 ■» Instruction Controller 

bit 5 = Data Cache Controller 

bit 6 = Pixel Organiser 

bit 7 - Operand Organiser B 

bit 8 = Operand Organiser C 

bit 9 « JPEG Coder 

bit 10 = Main Data Path 

bit 1 1 = Result Ortiniser : 

bit 12 - reserved 

bit 1 3 = Input Interface Switch , 

bit J 4 = reserved 

bit 1 5 *> exception on any module 


15:0 


exp_en[l5:0} 


Per-module exception enables: 

0 « exception conciiions from ihis module are not translated 
into an interrupt of the host 

1 = exception conditions from this module are translated into 
an interrupt of the host 

bit 0 * External Interface Controller 

bit 1 Local Memory Controller 

bil 2 «= Peripheral taerf ace Controller 

bit 3 •= Miscellaneous Module 

bil 4 =t Instrucdon Controller 

bit 5 = Data Cache Controller 

bit 6 e Pixel Organiser 

bit 7 = Operand Organiser B 

bit 8 - Operand Organiser C 

bit 9 = JPEG Coder 

bit 10 = Main Data Path 

bit 11 - Result Organiser 

bil 12 = reserved 

bit 13 = Input Interface Switch 

bit 14 - reserved 

bit 15 « exception cn Qny module 
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j. inm_gint 



Table 1.11 



mm_gint Register Fields 



31:16 



15:0 



intH 5:0] 



int_cn[15:0] 



Per-module interrupt s-atus dies 
Noic chai these biis are read only. To clear the uueirupt condi 
uon, the appropnate module intenupt bit should be cleared. 

0 = no in-emipt condition from this module 

1 = interrupt condition from this module 
bit 0 = External Interface Controller 
•bit 1 « Local Memory Controller 

bit 2 - Peripheral Interface Controller 

bit 3 = Miscellaneous Module 

bit A «= Instruction Controller 

bit 5 = Data Cache Controller 

bit 6 a Pixel Organiser 

bit 7 * Operand Organiser B 

bit 8 c= Operand Organiser C 

bil 9 ~ JPEG Coder 

Ml 10 -Main Data Path 

bit 11 -Restilt Organiser 

bit 12 -reserved 

bit 13 » Input Interface Switch 

bit 14 = reserved 

hit 1 5 = imerrupt on any module 



Per-raodule interrupt enables: 

0 = interrupt conditions from this module are not translated 
into an interrupt of the host 

1 = interrupt conditions from this module are translated into an 
imerrupt of the host 

bttO = External Interface Controller 

bit 1 = Local Memory Com roller 

bit 2 & Peripheral Interface Controller 

bit 3 » Miscellaneous Module 

bit 4 = Instruction Controller 

bit 5 - Data Cache Controller 

bit6 = n*el Organiser 

bit 7 = Operand Organiser B 

bi 1 8 - Operand Organiser C 

bil 9 - JPEG Coder 

bit 10 -Main Data Path 

bit 1 1 = Result Organiser 

bit 12 « reserved 

bit 13 = Inpui interface Switch 

bit 14 = reserved 

bit 15 = intenupt on any module 
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k. mm_active 
Table 1 .12 rnm_active Register Fields 









J _>.U 


mactive[1.5:0] 


; : . uescnpuon r 

Per-module active signals (Read Only): 






These bi 15 unlatched 






bit 0 = External Interface Controller 






bin = Local Memory Controller 






bit 2 = Peripheral Interface Controller 






bit 3 « Miscellaneous Module 






oil 4 «= Instruction Controller 






bit 5 = Data Cache Controller 






bit 6 « Pixel Organiser 






bit 7 = Operand Orgaiuser B 






bit 8 « Operand Organiser C 






bit 9 -JPEG Coder 






tat 10 « Main Data Path 






bit 1 1 « Result Organiser 






bit 12 = reserved 






tat 13 = Input Interface Switch 






bit 14= reserved 






bit 15 e any module active 


31:16 


/ 


Reserved 



Instruction Controller Registers 



I. ic_cfg 

The icjrfg register is divided into three pans. The least significant byte contains global con- 
figuration information. The third least significant byte contains A stream configuration infor- 
mation and the most significant byte contains the B stream configuration information. 

This register has a reset value of 0x00000000. 
Table 1I13 ic_cfg Register Fields 



; Field < 


■ * - Name ■ 


^ ; r^escnpiton v V; ;/:f:; ' 


1 


E 


l=enabled 
0=disabled 




2 


B 


0=not bypassed 
l=by passed 




3 


S 


0=not in self test mode 
l=self test mode 




[5:4} ; 


pri[1:0l 


instruction stream priority: 
0=»A high, B low 
l=B high, A low 
2^3= round robin 
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2.00 J> :j 

Table 1.13 ic_cfg Register Fields 







••Bi?!^ - : Description :"• J-;; 


6 


asr 


Asynchronous stall reject: 
0=not enabled 
! -enabled 


7 


pd 


Partial Decode enabie: 
0=not enabled 
1 ^enabled 


8 


cbus_eic_dis 


Disable requests from the EIC for Cbus cycles 
0«do not disable 
l=di sable 


9 


cbus_mm_dis 


Disable requests from the MM for Cbus cycles 
0=do not disable 
sable 


10 


cbus_jnt_dis 


Disable requests from the IC for Cbus cycles 
0=do not disable 
sable 


16 


a_en 


0=A stream disabled 
1 =A stream enabled 


17 


a_seq 


0-A stream sequence numbers disabled 
1«A Stream sequence numbers enabled 


18 


a_prefetcfi 


0=*A stream nr^frrrhinir /Keahlx»/1 
1 =A stream prefetching enabled 


19 


a_pverlap_en 


0=A scream overlapping disabled 
J =A stream overlapping enabled 




a_snc_pause 


0 A stream do not pause on "sequence number com- 
pleted" interrupt 

1 «= A stream pause on "sequence number completed" 
interrupt 


21 


a *c pauso 


0 - A stream do not pause on "instruction completed" 
interrupt 

1 = A Stream nZUV* f>n "incrnirrinn /•Amnl^A/)" - 

> i ^ ou^cji u. p&Lit uii iiu>iitiviiun compieteo inter- 
rupt 


22 


a_auto_prime 


0- A stream "secucncc number comnlr.rrd" imf-mirr 
not automatically primed 

1- A scream "sequence number completed" interrupt 
automatically primed 


24 


b_en 


0*=B stream disabled 

1 ■ — R ClrfOm jr^inlttrl 

1 — D iUulJll ciiauicu 


25 


b_seq 


0-B stream sequence numbers disabled 
l=B stream sequence numbers enabled 


26 


b_ prefetch 


0=B stream prefetching disabled 
1=B stream prefetching enabled 


27 


b_overlap_en 


0=B stream overlapping disabled 
l=B stream overlapping enabled 
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Table 1.13 ic.cfg Register Fields 







Kiisi»iiiiiiBii»; 


28 


b snc oause 


0 = a stream do noi pause on "sequence number com- 
pleted" interrupt 

1 -> B stream pause on "sequence number completed" 
interrupt 


29 


b ic oause 


0 ~B stream do not pause on "instruction completed" 
interrupt 

1 « B stream pause on "instruction completed" inter- 
rupt 


30 


b_auto_prim8 


0=B stream "sequence number completed" interrupt 
not automatically primed 

1=B stream "sequence number completed" interrupt 
automatically primed 


m. lejstat 

This renter is split into four sections. The least signi ficant byte contains ihe internal state of 
the IC. The second least significant byte conrairo the decoded result of the current instruc- 
tioa and current and prefetched instruction stream. The second most significant byte con- 
tains alt status information about the A stream, and the most significant byte contains 
information about die B stream. 

The reset value of this register in 0x00000000. 
ic_stat Register Fields 








[3:0| 


ec_state[3:0] 


instruaion execution state condition. 
O-idlc 

1 = fetching ijistrucliun 

2= waiting for overlapped instruction to finish 
3= decoding 

4^= prefetching instruction 
5=» wait for instruction to finish 
6=» synchronise with external accesses prior to updat- 
ing registers 
7= update state registers 
8= had state 


8 


overlap 


0 = overlapped instruction not in progress 
1 = overlapped instmcuoiuu progress 


9 . 


■cjnlerrupt 


0 - will not i ntemipt when completed 1 

1 = will interrupt when completed 


10 


jump 


0 - current instruction is not a jump instmction 

1 = current instruction is a jump instruction 


11 


prefetched 


0 = the next i nstrucuon i s not prefetched imo the 
prefetch buffer 

1 =the next instruction is prefetched into the prefetch 
buffer 
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Table 1 .14 ic_stat Register Fields 



1 Field 1 % 




— 1 ~ * - - 1 Description ; ; » * ; ' 


12 


slream 


currently executing ( non-overlapped instruction) 
stream: 

0 = stream A 

1 = stream B 


13 


preLstream 


stream of instruction prefetched: 

0 = stir am A 

1 - stream B 


14 


condition 


0 = jump condition not met 

1 =jump met 


16 


a_wait 


0 ~ A scream is running normally 

1 - A stream paused because semaphore is taken 


17 


a_as/nc 


0 b No asynchronous transfer in progress in stream A 
X = Asynchronous transfer in progress in stream A 


18 


a_busy 


0 = A stream not busy 

1 - A stream busy 


19 


a_sem 


A stream register semaphore status* 

0 = semaphore not claimed by anyone or claimed by 
hardware 

1 - semaphore held externally 

The a^ent recievinp [hp cpmanhArf* u/ilt caa o u r\ n in 
this bit 


20 


ajock 


0 - A stream not locked in 

1 = A stream locked in 


21 


a_primed 


0 = A stream "sequence number completed" interrupt 
not primed 

1 = A stream "sequence number completed" interrupt 
primed 


22 


a_paused 


A stream pause 

0 = A stream is not paused due 10 interrupt or error 

1 - A stream is paused due to interrupt or error 
rethe coprocesson execution by writing 1 


23 

• 


a_ol_primed 


0 A stream "overlapped instruction sequence number 
completed" interrupt not primed 

1 = A stream "overlapped instruction sequence number 

COmDletecT* intfminf nrimrri 


24 


bewail 


0 = B stream is running normally 

1 - B stream paused because semaphore is taken 


25 


b_async 


0 » No asynchronous transfer in progress in stream B 

1 = Asynchronous transfer in progress in stream B 


26 


b_bu$y 


0 = B stream not busy 

1 = B stream Uusy 
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Table 1.14 ic_stat Register Fields 





■ • :<■:!>■ <:* Stir*! 


i m $m$® - i j description -„ 


27 


b_sem 


B stream register semaphore status: 

w — scniapnorc not cjajmca uy an) one or neiu Dy die 

coprocessor 

1 - semaphore held externally 

The agent rccieving the semaphore will see a "0" in 

this hit 


28 


bjock 


0 » B stream not locked in 
J = B stream locked in 


29 


b_primed 


0 = B stream "sequence number completed*' interrupt 
not nrimed 

1 « B stream "sequence number completed" interrupt 
primed 


30 


b_paused 


B stream pause 

0 « B stream is not paused due to interrupt or error 

1 = B stream is paused due to interrupt or error 
rethecoprocessort execution by writing 1 


31 


b_ol_primed 


0= B stream "overlapped instruction sequence number 
completed" interrupt not primed 
1 - B stream "overlapped instruction sequence number 
completed 1 * interrupt primed 



n. ic_errjnt 

This register contains active high flags indicating whether an ciror or interrupt has occurred 
within the 1C, Each bit can be cleared by writing a T. 

The reset value of this register is 0x00000000. 
Table 1.15 lc_err_lnt Register Fields 









8 


ajlljerr 


A stream illegal instruction error 


16 


bjlljerr. 


B stream il legal i nstrucri on error 


24 


a_sncjnt 


A stream "sequence number completed" interrupt 


25 


ajcjnt 


A stream "instruction completed" interrupt 


26 


a_sns_jnt 


A stream "sequence number the coprocessorted" inter- 
rupt 


27 


ajsjnt 


A stream "instruction the coprocessorted" interrupt 


28 


b_sncjnt 


B stream "sequence number completed" interrupt 


29 


bjcjnt 


B stream "instruction completed" interrupt i 


30 


b_snsjnt 


B stream ** sequence number the coprocessorted" inter- 
rupt 


31 


bjsjnt 


B stream " instruction the coprocessorted" interrupt 
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o. <c_erMnt_en 

This register contains ite masks used to enable ±t various errors or intermpts and has n 
reset value of 0x00000000. 

Table 1.16 ic_errjnt_en Register Fields 



l; Field * 




D^sc notion ' \ 


8 


a_ili_err_en 


A stream illesal mstmcti on error rnahir* 


16 


b_JH_err_en 


B stream illegal instruction error enable 


24 


3 ^ni** int on 
d o» *^_JI 1 » 1 


A stream "sequence number completed" interrupt enable 


25 


a_jcjnt_en 


A stream "instruction completed" interrupt enable 


26 


O enc int 
d Jo>__H 1 1 


A stream "sequence number the coproces sorted" inter- 
rupt enable 


27 


ajsjnt 


A stream "instruction the coprocessorted" interrupt ena- 
ble 


28 


b_sncjnt 


B stream "sequence number completed" interrupt enable 


29 


bjcjnt 


B stream "instruction completed" interrupt enable 


30 


bjsnsjnt 


B stream " sequence number the coprocessortcd" inter- 
rupt enable 


31 


b_is_int 


B stream ** instruction the coprocessorted" interrupt ena- 
ble 



p. icjpa 

This register holds the most significant 30 bits of the next vinuaJ address to be used to fetch 
an instruction for the A stream. The least two significant bits are assumed to be zero as 
instructions must be word aligned/ 

The reset value of this register is 0x00000000 
Table 1.17 icjpa Register Fields 







— : ■ ^ > Description ~ ; : ^ ; 


[31 ;0] 


ipa{3l:2J 


A stream instruction pointer 



Table 1.18 



q. icjda 

This register holds the "to do" value for the A stream. This is a 32 bit (wrapping) sequence 
number up to which valid instructions exist. 

The reset val ue of this register is 0x00000000 
icjda Register Fields 



< " -Field; % ; 


• .;. ;K!ame \. 


;•: description^ , ,r 


(31:0) 


tda[31:0] 


A stream "to do" value 



r. icjna 

This register holds the "finished" value for the A stream. This is a 32 bit (wrapping 
sequence number indicating the last completed instruction. 
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The icset vaJuc of I his register is 0x00000000. 
Table 1 .1 9 icjna Register Fields 



Field , 


■■■■■■■■■ ■..;■.:■■..•■•->:•:■:•:■■■ 


f s- 1 Description 




|31:0] 


fna[31;0] 


A stream "finished" value 





s. ic_inta 

This register holds .he "imemtpr sequence number for the A stream. This is a T> bit (wan 
P»ng) sequence number a which to "interrupt at" if the mechanism is enabled and primed 
The reset value of this register is 0x00000000 
Table 1 .20 icjnta Register Fields 











131:0] 


inta{31:0J 


A stream "interrupt" value 



t. icjoa 

This register holds the 32 bit (wrapping sequence number of the last overlapped 
to be executed on the A stream. 

The reset value of this register os 0x00000000 
Table 1.21 icjoa Register fields 



instruction 



mmmm 






[3i:0] . 


loa[31:0) 


A stream "last overlapped instruction sequence" 
value 



u * c Jpb 

This register holds the most significant 30 bits of the next virtual address to be used to fetch 
an instruction for die B stream. The least two significant bits are assumed to be zero as 
mstrucuons must be word aligned. ^ 

The reset value of this register is 0x00000000 
Table 1.22 icjpb Register Fields 









[31:0] 


ipb[31:2] 


B stream instnacrion poinier 



v. icjdb 



This register holds the "to do" value for the B stream. This is a 32 bit (wrapping) sequence 
number up to which valid instructions exist ^*a*u"«.c 

The reset value of this register is 0x00000000 

Table 1.23 icjdb Register Fields 



. Fields ,' 




- - - Description - j ;s - •; 


|31.0| 


tdb[31;0] 


B stream lo do" value 
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w - ic__fnb 

This register holds the "finished" value for tl* r tk;c o no w , 

sequence number indicating the last completed indirection ' 
The reset vaJue of this register is 0*00000000. 
Table 1.24 icjnb Register Fields 



Field - ' 

[31:0] 


fnb[31:0) 


. r , •: : '.Description. 
B sirearn "finished'* value 













x. icjntb 

This register holds the "interrupt" sequence number for the B stream This is a 32 hir 
P«ng) sequence number at which to "intemip, at" if the mechanism is c^bl^d 
The reset value of this register is OxOOOOOOOO 
Table 1 .25 icjntb Register Fields 









[31:0) 


inU3[31:0] 


B scream "interrupt" value 



y. icjob 

The reset value of this register is 0x00000000 
Table 1.26 icjob Register fields 



instruction 



ii Fiiifii 


;m Name.- 


- " . i . — . — | 

;;.uru^ ;v . - Description ■ • "/ .: ^ ' ? , 


[31:0] 


lob[31:OJ 


b stream last overlapped instruction sequence" value 



z. Ic sema 



b?i££5£ r a* f ° r * C ' C - Stat reSi " er With the side *« » of this register 
ts a request for the A stream register semaphore. register 



aa. ic_semb 



This register is an alias far the ic stat register with the side effect ih„ a m * 

is a request for the B stream register semaphore. ' ' ° f ^ reg ' Ster 
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Input Interface Switch Registers 
ab. iis_cfg 

Table 1 .27 iis_cfg Register Fields 



> FieldfbftsV'i 


L^I^ ie l_'../ i - ' i definition * , 


[31:14] 




Reserved 


(13:12] 


PO-P 


pnonty of PO (0 = lowest, 2 - highest) 


(11:10] 


dcc_j> 


pnonty of DCC (0 « lowest, 2 = highest) 


[9:8] 


ic_p 


pnonty of IC (0 = lowest, 2 « highest) 


(7:4] 




Reserved 


(3! 


S 


0 = not in self test mode 






1 = self test mode 


(2] 


B 


0 n not bypassed 






1 = bypassed 


(I] 


E 


0 = disabled 






1 = enabled 



ac. iis_stat 



Table 1 .28 iis_stat Register Fields 









[31:25] 




Reserved 


(24] 


i_prefeecii_po 


state of i Drefetch_po signal 




Lprefetch^dcc 


state of i_prerctch_dcc signaJ 


[22] 


i_prefetch_ic 


state of i_prefetch__ic signal 


[21:20] 


Imc.pojpri 


rotating priority of POarbitradng for die LMC 


[19:18] 


Irnc_dccjpri 


rotating priority of DCC arbitrating for the LMC 


[17:16] 


lmcjcjpri 


rotating priority of IC arbitrating for the LMC 


(15:M) 


eic_po_lpri 


rotating priority of POarbitradng for the EIC 


[13:121 


eic__dccjpri 


rotating priority of DCC arbitrating for the BC 


[11:10] 


eicjejpri 


rotao ng priority of IC arbitrating for the EIC 


9 


lmc_gnt_po 


1 = PO is granted control of LMC 


5 " 


lmc_gnr_dcc 


1 ° DCC is granted control of LMC 


7 


lmc_^nt_ic 


1 «* IC is granted control of LMC 


6 


e>cjjnt_po 


1 = PO is granted control of EIC 


5 


eic_gnt_dcc 


I = DCC is granted control of EIC 


4 


ei <L_gntJc 


1 «= IC is granted control of EIC 


3 


vaIid_req_po 


1 = legal request made to IIS by PO 


2 


valid_req_dcc 


t = legal request made to IIS by DCC 
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Table 1.20 iis_stat Register Fields 



• I 


vaJid_reqjc 


Definition - ; * :v.<' : \. 


0 


Lsource 


1 - IcgaJ request made to IIS by IC 
State of i.source signal 



ad. iis_err_ini 



Table 1 .29 iis_err_Jnt Register Fields 



[31:16] 


interrupt 




[15:0] 


error 


intemipt condition " 
error conation: 

bit 3 = iBtts request made to IIS wiiiie disabled 

bit 2 - read request from illegal the coprocessor 
location 

from Pixe! Organiser 

bit L = reaj request from illegal the coprocessor 
location 

from Data Cache Controller 

bit 0 =* reed request from illegal the coprocessor 

location from Instruction Controller 



ae, iisjerrjntjcn 



1.30 iis_err_int_en Register Fields 



[31:16) 


int_jnask 


interrupt mask 


[15:0] 


err_mask 


error mask: 

bit 3 = IBus request made to lis while disabled 

bit 2 = read request from illegal the coprocessor 
location 

from Pixel Organiser 

bit 1 -itsd request from illegal the coprocessor 
location 

from Data Cache Controller 

bit o = ie2 d request from illegal the coprocessor 
location 

from Instruction Controller 
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at. iisjc_addr 
Table 1.31 iisjc_addr Register Fields 





- - Kame 


>'X\;& Definition.; ' :-/;';}. 


[31:0] 


ic_addr 


Address for data currently requested by IC 



ag. iis_dcc_addr 



Table 1 .32 iis_dcc_addr Register Fields 







Addrc^for ^ DCC^ 


(31:0} 


dcc_addr 





ah, iis_po_addr 



Table 1.33 iis_po_addr Register Fields 













[31:0] 


po_addr 


Address for data currently requested by PO 





at, iis_burst 



Table 1.34 iis_burst Register Fields 



Re1d(i>ifsi^. 






[13:0] 




Reserved 


[21:16] 


po_burst 


Burst length from PO 


[13:8] 


dcc_burst 


Burst length from DCC 


[5:0] 


ic_burst 


Burst length from IC 



aj, irsj)ase_addr 



Table 1 .35 iis_base_addr Register Fields 



i^e|l(bfts^i- 


— :: Name 




[3i:l2J 


the 

coprocessor__ba5C 
|I9:0J 


Base address of the coprocessor memory object 
in memory map. the coprocessor memory object 
is aligned on a page boundary. 


Ci i =oi 




Reserved 
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ak. iis_test 
Table 1.36 iis_test Register Relds 







III Defmitioa,VVv ||| | 


[31:4] 




Reserved 


(3:0) 


test_mode[3:0] 


0 - IC -> E1C, PO -> LMC 
0-IC-> E)C t DCC->LMC 
O-PO-^EIC, IC -> LMC 
0=PO-> EIC.DCC -> LMC 
0-DCC-> E1C, IC -> LMC 
0=DCC->EIC.PO->LMC 



External Interface Controller Registers 



a I. eic_cfg 
Table 1.37 eic_cfg Register Fields 





-Name .' £ 




0 


reserved 




1 


enable 


Enables all EIC operation as a master on PCI and 
Generic buses. 


2 


bypass 




3 


reserved 


Enables the operation of one or more of the EIC 
bypass modes. 


5:4 


pci_arb 


PCI Bus Arbitration: 

00 = Fair arbitration 

01 - IBus has priority 

1 0 = RB us has priority 

11 - Fair arbitration 


6 


pci J bus_minj cn 




7 


pci_rbus_minjen 




8 


pci_mri_mrrn_en 


Enables use of PCI memory read line and memory 
read multiple modes. 


10:9 


gen_arb 




11 


geruclk 


0 « Generic bus clock is 1/4 of clX 

1 - Generic bus clock is 1/2 of elk 


21:12 


mm u_ mask 


Mask bus that determine the page size used in the 
MMU. 


29:24 


hash_func 


Oetermine the hash function used for paee table 
lookups. 

— __ 
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er Fields 


mmm 


V: ■ Name. : - . 


:** : > t Definition V&tW%>* 


30 


liash. bypass 


if 1 Ihen bypass tl^c hash function completely so 
that the page able index is raken from the bottom 
13 bits of [tie virtual page number (regardless of 
page size). 


31 


mx7iu_b>'pass 


Forces the MMU to do null mapping on ail ~ 
requests. 



am, eic_stat 

Note; bics in this register can change at any time. 
Table 1 .38 eic_stat Register Fields 



Table 1.39 









0 


ibus^pcLact 


moicates an acuve request from the IBus to the " 
PCI bus. 


I 


rbus_pci_act 


Indicates an acuve request from the RBus to the 
PCI bus. 


2 • 


mrau_act 


Indicates the MMU is in tlie middle or an opera- ~ 
aon- 


3 


gen_act 


Indicates an acuve request to the Generic bus. 


4 


ibus_pci_gnt 


Ibus granted to PCI bus 


5 


rbus_pci_gnt 


Rbus granted to PCI bus 


6 


"^U-PCt^gnt 


MMU service granted to PCI bus 


31:4 


Reserved 





an. eic_err_int 
eic_err_lnt Register Fields 



0 


pagc_.fault 




1 


protjauk 


Set when a page fault error occurs. 
Set when a protection fault error occurs. 


2 


gcnjzn 


Set when the Generic bus error pin is asserted for 
at least one gen_ci.)c cyc [ c< 


3 


°en_addr_enr 


Set when a Genenc bus burst request crosses out- 
side the Generic bus memory region 


4 


rwd_err 


Request when disabled - set when the HIC 
receives an RBus or IBus requesi when its enable 
bit is zero. 


5 


targeLabon_crr 


A copy of the Received Target Abort bit in the 
PCI configuration space. 



-553- 



Table 1.39 



Ml 

eic^err Jnt Register Fields 



;\Field(bli<;)V 


Name " 


- < « . : - Definition ; . - : 


6 
7 


mastcr_abort_err 
reserved 


A CODY of tile SipnaM^ri T^r&*>r AJv-xri k:» ■ 

PCI confl prauon space. 


£> 
□ 


. data__pariry_exp 


A copy of the Detected Data Parity Error biU^T 
PCI confi guration space. 


9 

23:10 


gen_exp 
reserved 


Set when the gen_exp_i pin is asserted fbrai 
least one c Sn _clx cycle, 


24 


gen_int 


Sec when tine gen_int,l pin is asserted for ai 
least one gen_clx cycle. 


31:25 


Reserved 





ao. elc_errjnt_en 



Table 1.40 e\c_errj nt Register Fields 









0 


pagejault 


Enables the corresponding error bit to cause the 
assertion of c_err. 


1 


proc_fault 


Enables the corresponding error bit to cause the 
assertion of c_e rr . 


2 


geruerr 


Enables the corresponding error bit lo cause the 
assertion of c^err. 


3 


gcn_addr_err 


Enables the corresponding error bit to cause the 
assertion of c__err. 


A 


rwd_err 


Enables the corresponding error bit to cause the 
assertion of e_err. 


5 


wrgeLabon_err 


Enables the corresponding error bit to cause the 
assertion of c_err. 


6 


master.abort^err 


Enables the corresponding error bit to cause the 
assertion of c_err. 


7 


reserved 




8 


da ta_pariry^cxp 


Enables the couesponding error bit to cause the 
assertion of c_exp. 


9 


geruexp 


Enables the corresponding error bit to cause the 
assertion of c_exp. 


23:10 


reserved 




24 


genjnt 


Enables ihe corresponding error bit to cause the 
assertion cf c_int. 


31:25 


Reserved 
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ap- eic_test 
Table 1.41 eicjest Register Fields 



Field (61X3):'; 


. Name-,,- ; : 


Definition : \ 


3l:0 


Not yet defined 







aq. eic_pob 



Table 1.42 eic^pob Register Fields 









0 


gen_pobO 


vaiuc of tte genric bus programmable output pin 
pobO 


1 


gCn_pobl 


Value of the genric bus programmable output Din 
pobl y 



ar. eic_high_addr 



Table 1.43 eic_dual_addr Register Fields 









31:0 


high_addr 


Offset of 4GB region in host memory in which the 
coprocessor data structures exist (forduai address 
cycles). 



as. eic_wtlb_v 



Table 1.44 eic^wtlb^v Register Fields 









1:0 


PLfiags 


The flags associated with the TLB entry to be 
written. 


11:2 


Reserved 




31:12 


vpn 


Vmual Page number to be written or invalidated. 
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Table 1.45 



at. eic_wtlb_p 
eic_wilb_p Register Fields 



0 


inv_aJl 


^ - SeHnitton , > 

Causes the entine TLB to be invalidated. The EIC 
clean this bit after performing the TLB operz. ion 


1 


invjemry 


Invalidates a TLB entry with the vimiii ^ a «r. 
number specified. The EIC clears this bit after 
perforrrJ ng the TLB operari on. 


2 


wriie_lru 


Wntesa iLB entry to the LRU location with thc~ 
specified virtual page number, physical pa-c 
number and control bits. The EIC dears tfiis bit 
after performing the TLB operation. 


3 


write 


Wnte ths TLB entry specified by the Entry ~^ 
Number field with the specified virtual page 
number, physical pass number and flag bits. The 
EIC clears tins bit after performing the TLB oper- 
ation. 


7:4 


em_num 


Entry number to be written by Write Entry ope-aT 
u'on. ^ 


31:12 


ppn 


Physical Page number to be written. 


au. eic_mmu_v 

Note: values in ti* register can change at any time, umess the MMU is disabled by a pace 
faul t error or an MMU to PCI bus error. 0) p3ge 


eic_mmj_v Register Fields 








3:0 


mmu_lru 


Current LRU location 


4 


mmujiit 


mmujiit is set when that lastTXB access resulted 
in a hit without any page table access 


5 


mmu_rcrry_hii 


v_nm w_.rarv Jut is set when that last TLB access 
resulted in a hit after a page table access 


11:6 


Reserved 




3l:t2 


mmujvpn 


ine most recent virtual page number sent to the 
MMU for translation. 



av. elc_mmu_p 

Note: values in this register can change at any lime, unless the MMU is disabled by a oaee 
fault error or an MMU to PCI bus error. uiadwcu u> a page 

Table 1.47 eic_mmiJ_p Register Fields 



IIIIIS1 




? . ^P0fint1tOt» r ■ _..U;— - X 


31:0 


mraujpta 


The mcst recent page table physical address 
fetched by ihe MMU. 
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aw. eicjp_addr 



Note: va.t, es in M S renter can chan« at anv , is - unless lne ]BD js dj 
Table 1.40 eic_ip_acJdr Register Fields 





: - -Name \ ^^Pi^rifclpnl sZ^^yU- 


31:0 


libpa 


^ h> PCr B addrcSS 01 mc mmi recem 1Bus acces * «o ~ 



ax. eic_rp_addr 
Table 1.49 eic_rp_addr Register Fields 



disabled by an R Bus 









31:0 


lrfrpa 


Physical address of the most recent RBus access " 
to the PQ Bus. 



ay. eicjg_addr 

Notches in this register can at any tinx, unless the GBC is disabled by a Generic 
Table 1.50 e!c_ig_addr Register Fields 









31:0 


Hbga 


Address of the most recent IBus access to the 
Generic Bus. 



az. Pfc_rg addr 

BuTer" ^ «■«. —Che GBC is dis abIed by a Generic 

Table 1.51 eic_rg_addr Register Fields 



ilftlffli 






31:0 


Irbga 


Address of rhe most recent RBus access to the 
Generic Bus. 



PCIbus Configuration Space Alias 

SSeSSSSSS 00 Spacc which consis:s of 16 words is a,iased to - 
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Local Memory Controller Registers 



ba. Imi cfg 
Table 1 .52 imi cf g Register Fields 



t-iOld{bits) 
(31:24) 


* - Carrie A : ( 
refjnterval 


■ ^ 

lm i Desbription :M §1||§ 


[23:22] 


ro_prio 


Refresh interval m multiples of 4-clk rxriod< 
RO access prionty (3 = highest) 


{21 :20J 


pic_prio 


PIC access pnonty (3 - highest) 


119:18] 


ic_prio 


IC access pnonty (3 « highest) 


(17:16) 


iis^prio 


IIS access pnonty (3 = highest) 


(15:13) 


nearbJntervaJ 


Ke-artntration interval in words (2*n words) ] 


(12J 


mem^enable 


Memory enable (1 «= memory installed) i 


111) 


banks 


Number of DRAM banks connected (0=1 1=2) 


(10) 


1 iis_flush 


Rush the IIS Drafetrh Hiira f«/»lf -.i-;.»^n 

***** ^ti**H^*i uuui ^5cu cjearuiK * 


f9) 


iis_pnefetch 


IIS Dltfeichin* rnahlp 


(8) 


! refresh_en 


.triable anin fPER^ n-Tn-»rV«ir-.r» at nr> » » * 
^ UAU «wu ^vdhj reircsning or DRAM 


U) 


seamless 


jiiLauic bcaiTuess oUKAJvi page faulting 


16] 




reserved 


(5:4) 


bvpass_mode 


Bypass mode select: 

00 = Rous 

01 = Abiis write 
10 «■ Abus read 

U-Ibus 


(3) 


S 


Self test mode 


(2) 


B 


Bypass enable 


tl) 


E 


Enable module 


(0) 




Reserved 



IS 'S Kr C ° ntalnS imny confi * uraJion ^ C0 ™°1 bits used to define LMC operation 

The register has a reset value of 0x20000100 setting the refresh interval to 3.2 us at a clJcin 
frequency of 80 MHz. All special modes and features are disabled at power up^d aVacce* 

is disabled (E=0). Refreshing is not affecied by the E bit 
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bb. Imi_stat 
Table t .53 lmi_stat Register Fields 




AH bits within this register are readonly. 

be. Imi_err_jnt 
Table 1.54 Imi.errjnt Register Fields 
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one to a particular bit will cause that bit to he rp^» w^. « 

t^, k:. mi io re reset. U ni:ng a zero to any bit lias no effect on 



that bit. 

Table 1.55 Interrupt Bit Definitions 



24 



25 



: -~ ' ": Description : . . " ■ ' 

Refresh Intcirapc. Occurs o nce per refresh i Mm !i"^7 " 



Refresh noi serviced prior to subsequent refresh 



request 



Table 1 .56 Error Bit Definitions 



Access to DRAM when no DRAM enabled 



Access to DRAM while module disabled 



Rbus Address Error 



A bus Address Error 



Cbus Address Error 



I bus Address Error 



f bd. lmi_err_int_en Register 
Table i .57 lmLerr_int_en Register Fields 













p r---:' -Description - — * 


[3t:24] 


1 intern ask 


Interrupt mask bits 


£23:8| 


exp_mask 


Exception mask bits 


[7:0] 


err_rnask 


Error mask bits 



S™;^ Cepti T- intCITUpt Cnablc rcgister is used 10 Actively enable and disable 
error, exception and tnrcrrupt signals. The register is read/ write. This irfste ^ J 

IS* SIT of u * cir ^ excep,ion intempt -SKS n; 

re ister. There is a one-to-one correspondence between the bits in this register and the bi« in 
the lml_ e rrJntreg 1S teL If a particular bit in the Imi err int en re*iste * iset w!h 21 ,h 
corresponding bit in the ImLerrJn, register will be enabled ,Z X ^£&S£T 
«k : error, exception or intem.pt output signal, c.err. c exp or c „ £ft ™^ if i 
particular lrn L errJnt_ e n bit i , cleared to rero then the correspond! „, Sn aneTrnf £ ^ 
renter „ d-sabted and cannot cause the assertion of the o t£ £ *J l?J2j sfnS 
ST"* ^ ^ ^ thC eXp - ask bi « F " have ^effect 5 

;trxrr= t 2? dsawes - — — — ■ 
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be. Imi_dc(g 
Table 1.58 lmi_dofg Register Fields 



[31:29] 


row^btts 


Number of DRAM row address bits Cn + 61 


[28:26] 


coUunn_bits 


j^uinoeror ukam column address bus (n + 6) 


(25) 


edo_dram 


EDO tVTY* DRAM HI — I.T> t cr\n\ 


[24:22] 


prccharge_enable 


Prcchargc enable address bit (n + 5) ~ 


I21I91 


precharge^bank 


Precharfe hank address bit (n 4 5) 


F18M71 


casjo 


Number of CAs low clock cycles (CAS pulse 
width) {I to 4,0= l) 


[16] 


cas_hi 


Number of CAS high clock cycles (CAS pre- ~ 
charge) (1 to 2.0= i) 


(15:13) 


ras lo 


Minimum RAS low clock cycles (1 to 8» 0 » O 


[12:11] 


ras_hi 


Minimum RAS high clock cycles CR AS pie- 
charge) (I to4 t 0 = I) 


[10:9) 


ras_cas 


RAS to CAS delay in clock cycles (2 to 6 0 = 2^ 


[8:7] 


casjar 


SDRAM CAS latency in clock cycles: 
0, I- 1 clock 
2 = 2 clocks 
3-3 clocks 


[6:0] 




Reserved 



bf. Imi_mode Register 
Table 1.59 Imi_mode Register Fields 





Reserved " ^ 


[31:14] 






[13] 


initialize 


Initialize SDRAMs (program mode register, self 
clearing) 


112:0] 


sdrarn_modc 


SDRAM mode biu (written to SDRAM mode 
register over the address pins by initialize 
SDRAM command) 



This configuration register holds the information iha. is written into Che SDR MA mode rec 
pan Of ■« initialization procedure. This registry be read or writ, »JS?,K 
and may be wnuen 10 the S DRAM by setting the initiate bit. 

This register has a reset value of 0x0037. This is a useful default value that is required imme- 
d.ately after precharging „ power up or after a level 1 reset. This sets read laTenrl .n i 
clocks and burs, length to full page with sequential wrap '° 3 
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220 



After any reset, Ujc initialize bil win set ir Uie &d!--»rr i nm ;< \« w ^ ;«v ■> 

^ SORAM .ode reg is,e, AAcr mods ^^S^^l^Z 

cleared to icro automatically. 



Peripheral Interface Controller Registers 
bg. pic^cfg 



Table 1.60 pic_cfg Register Fields 



v. mmm 




' ' Definition - • * 


0 


reserved 




1 


enable 


HnabJcsaliPIC operation. 


2 


bypass 




3 


reserved 


— . 






Causes all byte paebn^unpacking to use big- 
endian order, i.e. bits 3 1 :24 first, bits 7:0 last 


5 


video mode 


0— Video Output 
I - Video input 


7:6 


reserved 




8 


gate_out_clock 


Used in Video Output mode. Causes the output 
clock to be sated with volk en in l 


9 


inv_out_clock 


UiCU 111 > » u eo output mode. Causes the output 
Mock to bt inverted with respect to vclk In. 


10 


usc_ricfaultjjaia 


Used in Video Ontrmr m/-^p TrfV»;«- v*:» :* . , 

t iucu wuipui moae. ii irus oil is set, the 
value in the default_d 9 ta register bit is driven 
onto the output data pins for cycles when data is 
not valid Otherwise, the nest vaJid data byte is 
driven if available. 


II 


default_data 


Used in Video Output mode. The default data 
used if use^def ault_data is enabled. 


12 


data_cn_sourte 


Used i n Video Output mode. When 1 , " 
vdata_en_put_Jl is asserted when 
vdata_en_in^l is sampled asserted. Other- 
wise, vdata_en_out_l is asserted when 
vclk_en_in_l is sampled asserted. 


13 


deLdata 


Used in Video Output mode. Causes the output 
data to be delayed one cycle with respect to 
vdata_en_out_l. 


14 


gate_\viih_rdy 


Used in Video Output mode. Causes the output 
clock to be gated with vrdy_L. 


15 


reserved 




16 


forward_aa_dis 


Disables the assertion of c_acti vc due to data 
in the output FIFO. 


17 


reverse_act_dis 


Disables the assertion of c_activ e due to data 
in the input FIFO. 


18 


forvvard_priority 


Keiative priority of forward transfers when in a 
lidirectional mode. 
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Table 1.60 



pic_ctg Register Fields 



19 



20 



21 



22 
23*" 



- , » Narne-.- - :;: 



revene_priority 



forwardjnverwlata 



revcrsejnvsrtjiata 



forward J nvert_control 




Relative prioriry of reverse transfers when in a 
bidirectional mode. 



Invert output d ata signals 
Invert input data signals 



reverse J n ven_contro[ 



Invcn output control signals 



Invert output control signals 




bh. pic_stat 



Table 1.61 ptc_stat Register Fields 









0 


abus^act 


Set wren an ABus transaction is pending or in ' 
progress. 


i 

2 
3 


rbus^art 

reverse_act 
cenL_cfg 


Set when an RBus transaction is pending orln 
progress. 

Set a when a reverse data transfer is active. 


4 

5 

6 


forvvardjull 
forward_suhword 


U>py of the external cent cf ° pin. 
Set wten the PIC's output FIFO is full. 
Set when the PIC's output FIFO has space for 
more than zero but less than four bvte<: 


7 


fcrward_empry 
reserved 


Set when the PIC's output FIFO is emotv 


$ 


reverse Jull 


Set when the PIC's input FIFO is full. 


9 


reverse_ernpty 


Set when the PIC's input FIFO is empty. 


10 


revene_subword 


Set when the PIC's input FIFO contains more 
than zero but less than four bytes. 


15:11 


reserved ■ 




16. 


pic_plh 


The state of the p i c_pih pin. 


17 


pic_ackj 


ihe state of the pie_ack_l pin. 


18 
19 


pic^busy 
pic^pcnoi 


I he state of the pic^busy pin. 


20 


pic^sclcct 


i ne state of the pic_ P error pin. 
me state of the pic_s elect tin. 


21 


picj"aultj 


The state ofthe pic_ fault loin. 




pi c_data Jn 


The state of the pic .data-in sionai 
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7U 

bi. pic__errjnt 

Error aj^imEnvp bits of the pic_ err . in! re S is.er can oniy be. set by the PIC. and can on i v 
be reset by software, liech bit is reset by writing a I to .hat bit. 5 

Table 1 .62 plc_err_in1 Register Fields 



mam 






a 
\j 




Forward transfer error. Set if the PIC receives out- 
put data to transfer when configured for an inout- 
only mode. y 


\ 


rwd_err 


Request when disabled error. Set if the PIC is chV~ 
a!)ted when it receives request for an RBus trans- 
action, or any register-iniiiated operation (ABus 
transfer, Centronics command, etc.). 


2 


uineout_err 


Peripheral timeout error. 


8 


rcvcrsc_cxp 


Kcvcise transier error. Set if software attempts to 
the coprocessort a reverse transfer when the PIC 
is in an output only mode. 


9 


cbus exp 


Set wnen a CBus cycle to the input or output data 
stream time* out When this bit is set, further 

CBuS Cycles t-O the innttf nrmimiir rl.ji.j 

/ v.wo iv u *b utjjuL urQucput finta streams 
2re treated as reserved. 


24 


icv_compjm 


H/ic-vuuni Jinut nas been reached in 
a reverse transfer. 


25 


rev_datajnt 


Set when data is flv»rtahi*» fw*™ o ^ 

"" 4I uoul ,3 <iv4iiaoie irom a reverse transfer 

and there is no active bus transaction to indicate 
its destination. 


26 


abus^compjnt 


Set when abus transfer completes 


27 


ciraerjni 


Set when timer 0 or 1 expires 


28 


comp_fauli w im 


Set tn Centronics compatible mode when the 
pic^faultj signal is asserted. 


29 


comp_perrorjm 


Set in Centronics compatible mode when the 
pic_perror signal is asserted. 


3G 


ecp_rev_re<]_iru 


Set in Cenrronjcs ECPmode when die peripheral 
indicates thai it rm reverse data available 




no_peripheral_iiu 


No peripheral interrupt 


H pic_err_int_ 
Pte_err_int_en F 


en 

Register Relds 






*:-:^;;.., ^Definition ," . ~ H 


0 


forward_crr 


Enables the corresponding i me rrupt bit to cause 
the assertion of c_int. 


1 


rwd_err 


Enables the corresponding imenupi bit to cause 
the assertion of c_iu i. 


2 


u.meout_cn- 


Enables the corresponding mierrupt bit to cause 
the assertion of c_int. 



-564- 



Table 1.63 pic_errjnt_en Register Fields 



Fieid(btfs) 


Name ! ■ 


- definitions 


8 


rcvcrse_cxp 


Enables the corresponding internipt bit to cause 

*■* £ic>oL- j u \ji i ui c in L. 


9 


cbus_exp 


Enables the corresponding interrupt bit to cause 

thp" fl^vrtinn of r-» •» r» 


24 


rev_comp_jnt 


Enables the corresponding interrupt tat to cause 

trip 3*l<^rfion t\f r* «» 


25 


rev^datajnt 


Enables the corresponding interrupt bit to cause 
the assertion of { n t- 


26 


abus_cqrnpjnc 


Enables the corresponding interrupt bit to cause 
the assertion nf { nf 


27 


timerjnt 


Enables the corresponding interrupt bit to cause 
the assertion of c in t 


28 


comp_fauk_im 


Enables the corresponding interrupt bit to cause 
the assertion of c_int. 


29 


comp_petrorjnt 


Enables the corresponding interrupt bit to cause 
the assertion of c_int. 


30 


ecp_rev_req_jnt 


Enables the corresponding interrupt bit to cause 
the assertion of c_int. 


31 


no_peripheralJnt 


Enables the corresponding interrupt bit to cause 

the assertion of c int. 
- 



bk. pic_abus_cfg 



Table 1 .64 pic_abus_cfg Register Fields 





. - iMarny 


warn mm m^mmmm% % - % 


23:0 


ab_count 


Number of bytes remaining to transfer to/from the 
LMC. Set initially by software, and decremented 
by the PIC as each byte is transferred. 


27:24 


ab_byte_en 


Byte lanes to be used for ABus transfer 


28 


ab_type 


1 « Transfer is a read (from peripheral pot) 
0 «= Transfer is a write (to peripheral port) 


29 


ab_the coproces- 
sort 


the coprocessor!.* the programmed ABus transfer. 
This bit is cleared by the PIC when the ABus 
transfer has been completed. If cleared by soft- 
ware, the ABus transfer is terminated. 


31:30 


reserved 
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bl. pic_abus_addr 
Table 1.65 pic_abus_addr Register Fields 



■ riBld(bils)' 






31:0 


ab.addr 


The byte address a which the nexi ABus transac- 
tion should te the coproces&oned. 



brn. ptc_cent_cfg 

The pic cenLcfg register contains read/write control signals and read-only status si anal 
control ail aspects o; the interface when Centronics mode is enabled. 



Table 1.66 pic_cent_cfg Register Fields 







^Cent^ 


2:0 


cen_cmd 


000 Return to Compatible 

001 - Request Nibble 

010 ~ Transfer Device ID using Nibble 

011-100 = no-op 

iui — i^equest cur 

110 = Transrer Device in using ECP 

111= Abort 


3 


ceruexe 


the coprocessors execute of programmed Centro- 
nics command. This bit is reset when the Centron- 
ics command completes. 


4 


pan CIC 


This read-only bit shows the completion status of 
the previous Centronics coirunand. A 1 indicates 
that the command was successful, a 0 that it was 
unsuccessful. 


7:5 


ccn_modc 


Read-only bits showing current Centronics mode - 
and operation based on the last negotiation: 

000 = Compatible 

001 = Compatible with Nibble enabled 

010 - Transferring Device ID using Nibble 

011 - Nibble Mode 

100 = reserved 

101 = ECP forward mode 

110 = Transrerring Device ID using ECP 
111= ECP reverse mode 


8 


cen_direct 


Bypasses the Centronics control logic and allows 
the software to direoJy control the output bi ts by 
writing and read i ng thi s regi stcr. 


9 


cen_host_rec_en 


Enables host recovery mechanism in ECP mode. 


15: 10 


reserved 




23:16 


cenjim 


Timer value, in cl k cycles, for all Centronics 
seiup. hold and pulse times. 


31:24 


ce.n_perj;m 


Timer value, in lots of 64k elk cycles, when 
waiting for a Centronics peripheral response. 
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hn. pic_ccnt_dir 
Table 1.67 pic_ceni_dir Register Fields 



1 Refd{bits) 


r ^tNarnp - - " 


. , » v Detention ■■;,;>-. 


0 


r>ir Hata 


Direct comrol bit lor the pic_data_oe signal. 


1 


pic_strohc_l 


Direct coniro! bit for the pi c stiobe 1 *i*mi 


2 


pic_autofdJ 


Direct control bit for the plc_autcfri l sisnal 


3 


picjritj 


Direct control bit forthe pic_init l signal 


4 


pic^se!eciin_l 


Direct control bit forthe pi. o_select in l sig- 
nal. ° 


5 


pic_buf_enj 


Direct control bit for the p i.c_buf _ en l signal. 


6 


picj>uf_dir 


Direct control bitfor the pic_buf dir signal 


7 


reserved 




15:8 


pic_data^out 


Direct control bit for the pic data out sisnal 


31:16 


reserved 





bo. pic_reverse_cfg 



Table 1.68 ptc_reverse_cfg Register Fields 



mmm 






23:0 


rev_count 


Number of bytes remaining to transfer from the 
external peripheral. Set initially by software, and 
decremented by the PIC as each byte is trans- 
fared. 


24 


iev_use_counc 


Causes the reverse transfer to be terminated after " 
the specified number Of bytes have been trans- 
ferred. 


25 


rev_die coproces* 
sort 


the coprocessons the programmed reverse trans- 
fer. If the rev_use__count bit is set, this bit is 
Cleared by the PIC when the reverse transfer has 
been completed. The software can clear this bit to 
stop the reverse transfer at any time. 


31:26 


reserved 





bp. pic_JijnerO 



Table 1.69 picjimerO Register Fields 



..Fieldibiisj^ 






31:0 


ttrnerO 


1 lmer value for firsi data timeout, in units Of one 
cLk. 
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2H 

bq. picjimerl 
Table 1.70 picjimerl Register Fields 





: : " Name N ' ' ^ 




3i;0 


tirnerl 


Timer value fordata-ro-daia timeout, in units of 
one elk. 



Data Cache Controller Registers 



br* dcc__cfgl 
Table 1.71 dcc_cfg1 Register Fields 



i.WAV*W 






^^^^^ 






liiiitiiiilll 


t 




E 




0=disabled 
Enabled 






2 




B 




0=not bypassed 
I ^bypassed 







bs. dcc_cfg2 



Table 1.72 dcc_cfg2 Register Fields 



Pi 6 kJ (bits) 






0 


Auloinv 


0-not m auto-invalidatc mode (value upon reset) 

l=auto-invalidaie cache at me end of each instruc- 
tion 


1 


Autofill 


0=not in auto-fiH mode (value upon reset) 
I=auto-fitl cache the coprocessorting at the 
address specified indcc_addr. Cache is validated 
at the end of the operation. This bit is setf-clcar- 
ing. 


2 


Lock 


0=data is written to cache RAM after data fetch 
(value upon reset) 

1-data is not written to cache RAM after data 
fetch 


3 


always_hit 


(WTag memory and valid bits are checked (value 
upon reset) 

l=Data is assumed to be always valid and resident 
in cache 


4 


always__rniss 


0=Tag memory and valid bits are checked (value 
upon reset) 

1 «Data is assumed to be not resident in cache 
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Table 1 .72 dcc_cfg2 Register Fields 



iliilill 




m I -Definition 


[10:8] 


mode 


current mode of caching operation: 

0= random access- normal mode (value upon " 

reset) 
1-JPEG Encoding 
2=s!ow JPEG Decoding 
3 = sin°If* cluinnp] colour <;njipp rAn\»/*rcinn 

4=mu!ti channel colour space conversion 
5= matrix multi plication 
6=64 bit mode 

7=Invalidare all entries in cache. The DCC will 
revert to mode 1 when alt the "line valid" bits 
have been cleared. 

This field is reset at the end of each instruction. 


11 


cache_miss_inst 


current mode of caching operation: 

0=Tag memory and valid bits are checked (value 
upon reset) 

1-Data is assumed to be not resident in cache 
This bit is reset at the end of each instruction. 


bt. dcc_stat 

i 

dcc_stat Register Fields 






^HHBKU^S^: ill < : i \ 


[3:0] 


error[3:0] 


error condition: 

bit 0=attempted R/W operation to cache or Tag 
memory with the module disabled. 


[p:ql 


state[7:0] 


internal state condition. Details to be determined 


bu. dcc_erMnt 
dcc_errjnt Register Fields 






■4 ; ; » « m t m DfiMltiOrV < g m \ : & u 


0 


error 


attempted R/W operation to cache or Tag mem- 
ory with the module disabled. 
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bv. dcc_errjnt_en 
Table 1.75 dcc_err_im_en Register Fields 



Reld(tjils) , 


• ' ; Name ['■,," 


. . - - ' Definition > 


0 


error_en 


enable corresponding error 


bw, dccJvO 

dccJvO Register Fields 








&m> > - Definition tsi * - . 


(31:0| . 


tv0[31:0] 


valid bits for lines 31 down to 0 



bx. dec hM 



Table 1.77 dccjvl Register Fields 





~ ' TI -i" ' * Definition' - ' ' ~ - 


(31:0) | lv 1(3 1:0) 


valid bits for lines 63 down to 32 



by. dccjv2 



Table 1.78 dccjv2 Register Fields 



F*oldfoit$}_ - v ^ Name — -- 






: : •:•:•>: 'X-v 




(31:0] | lv2[31:OJ 


valid bits for lines 95 down to 64 



bz. dec Iv3 



Table 1.79 dcc_lv3 Register Fields 







MB | \ £■> Definition | ; • j 


[31:0] 


lv3[31;0] 


valid bits for lines 127 down to 96 
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ca. dccaddr 
Table 1.80 dcc_addr Register Fields 









131:0] 


dcc_addr 


Thi s is the base address register used for Uie spe< 
cial addressing modes. Note that different bits of 
this register are used depending on lhc opera- 
uonal mode. 



cb. dcc_raddrb 



Table 1.81 dcc_raddrb Register Fields 









[31:0] 


dccjraddrb 


This is the sutus of Operand Organiser B's 
address 



cc. decjraddre 



Table 1.82 dcc_raddrc Register Fields 









[31:01 


dcc_raddrc 


This is the status of Operand Organiser C's 
Address 



cd. dec Jest 



Table 1 .S3 dec Jest Register Fields 



W3311 




0 


drive_odata 


Tliis bit is only valid when the test_mode bits are 
seuoO. 

0=do not drive Operand Organiser data busses 
(value upon reset) 

l*Ocive Operand Orgajuser data busses 




o_ackb 


This bit is only valid when the test_rnodc bits arc 
set too. 

This bit is only useable when the DCC is in ils 
test mode and die drive_odata bit is set. This bit is 
self-clearing. 

0=o"o not assr.n o_ackb (value upon reset) 
1 --assert o_ackb for one elk cycle 
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Table 1.83 



dccjesi Register Fields 







^ , * uejinjuon s ; f«. 


2 


o_ackc 


This bii is only valid when ihe tesLmode bits arc 
set to o. 

This bit is only useable when the DCC is in its 
test mode and the drive^odaia bit is set. This bit is 
self- clearing. 

0=do not assert o_ackc (value upon reset) 
l=assert o_ackc for one elk cycle 


[5:3) 


iest_mode 


0=drive odata busses rnode(value upon reset) 
l=drive OOB address out to IIS, i_oe=0 
2=drive OOC address out to 11$, i_ Q e=0 
3=drive IIS data to o_datab f 31 : 0 ] , i_oe=t 
4-drive IIS data to o_datac f 31 : 0 } . i_oe=l 



Operand Organiser Registers 

Note that there are two identical Operand Organisers: Operand Organiser B and Operand 
Organiser C. The registers for these two Operand Organisers (which are identical) are 
described here. 



ce. oonj?fg (oob_cfg=0x70, ooc_cfgr0x80) 
Table 1.84 oon_ctg Register Fields 





llHiii III 


< - ^- - . , l 1 t .J > - ? „ „ 1 ■ 

.Description ^ K 


Pl:9] 




Reserved 


[8] 


operate 


the coprocessor tit for 00 :(self-cleared at end of each 

instruction) 

0«ldle 

1 = operate 


(7:2] 




Reserved 


[1] 


E 


0 = disabled 

1 - enabled 


(0] 




Reserved 
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cl. oon_stat (oob_cfg=0x71, ooc_cTg=0x81) 
Table 1.85 oon_stat Register Fields 





• - 


» - ,i , v,, ♦ ; ■ " . " , ' 




. Nnme'-r 


.. ;f: , r; ;0escrjp{j0n 


[31:24] 




Reserved 


[23:16] 


state[7:0] 


internal state condition : 
bit 0=00 active 

bit 1 - OO stalled (sequential mode only) 
bit 2 m FIFO empty 
bit 3 s FIFO full 


C15:0] 




Reserved 



eg. oon_errjnt(oob_err_mt=0x72 l ooc_err_int=Dx82) 



Table 1,86 oon_err_int Register Fields 



^ Field ^ 






[15:01 


error 


error condition r 

bit 0= OBus request received by OO while disabled 

bit 1 = MDP and JPEG Coder requesting data simultaneously 



ch. oon_errJnt_en (oob_errjnt_en=0x73, ooc_err_int_en=0x83) 



Table 1.87 oon_errjnt_en Register Fields 



tfilll^ppil 






(15:0] 


error 


error condition : 

bit 0=* OBus request received by OO while disabled 
hit 1~ MDP and JPEG Coder requesting data simultane- 
ously 
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ci. oonjmr [oob_dmr=0x74, ooc_dmr=0x84) 
Table 1 .88 oon_dmr Register Fields 





: v.-': :--.':.-.-..'.^.w.-.vi^xi ; '-v v 




(31:30] 


ls3 


lane swap for byte 3 : 

0 = from byte 0 

1 = from byte 1 

2 = from byte 2 

3 = no swap 




ls2 


lane swap for byte 2 : 

0 o from byte 0 

1 - from byte 1 

2 - no swap 

3 » from byte 3 


[27:16] 


hi 


lane swap for byte 1 : 

0 = from byte 0 

1 *= no swap 

2 = from byte 2 

3 from byte 3 


i 


IsO 


lane swap for byte 0 : 

0 -» no swap 

1 = from byte 1 

2 = from byte 2 

3 - from byte 3 


123:20] 


suben[3:0J 


Substitution enables : 

0 - do not perform substitution operation on this byic 

1 u^c Yaiuc stoxea in oo/i_sudsi for this byte 


[19:15} 


replicate 


Rpfllirflfirkn /""7m mr • i 

indicates number of additional data items to seneraie 


[14:12] 


normalise 


Normalisation factor : 
0=1 bit per channel 
1 » 2 bits per channel 

2 = 4 bits per channel 

3 o 8 bits per channel 

4 to 7 = 1 6 bits per channel 


[U] 




Reserved 


{10:8} 


bo[2:0j 


Bit offset within byte of bit object 1 


(7) * 


P 


External Format : 

0 input objects are unpacked bytes 

1 = input objects are a packed stream 


16:5] 


if 


Internal Format : 
0^ pixels 

1 = unpacked bytes 

2 = packed bytes 

3 «■ other 
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Table i .00 oonjmr Register Fields 







/ a ^pocnpiioiv • f / ' ' 




F4:3) 


cc 


Input Channel Count : 

0 = 4 active channels 

1 » I active channels 

2 = 2 active channels 

3 = 3 active channels 




[2] 


L 


0 = immediate data in oo_said register 

1 = direct addressing 




f 1:0] 


what 


Addressing Mode : 

0 «= bypass 

1 » sequential 

2 = tiling 

3 constant data 





cj. oon_lsubst (oob_subst=0x75, ooc_subst=0x65) 



Table 1.89 oon^subst Register Fields 





ttNOffltal - i HHffi III 


[31:0] J 


subst | vaJue of substitution data 



ck. oo/Kcetp (oob_cdp=0x76, ooc_cdpt:0x86) 



Table 1.90 oon^cdp Register Fields 









(31:0] 


cdp 


current data pointer 



el. oonjen (oobjen=0x77, oocjcn=0x87) 



Table 1.91 oonjen Register Fields 





■ -r y -t\ t ~ '1; : 




[31:0] 


len 


length of input operand stream 


cm. oon_said (oob_s3id=0x78, ooc_said=0x88) 


oon_said Register Fields 






••••••• • »:<*»•. 'v.- 


li 1 ' fje^enption ; 


[31:0] 


said | 


ihc coprocessor! address a imrnedi ate data " 
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cn. oonjile (oobji!c=0x79, ooc_ttlei:0x89) 



Table 1.93 oon_iile Register Relds 







\ :,::1L ; Description - . . 


j [3l:l6J 


orfset[15:0] 


offstii irxo operand ~ 


[15:0) 


tilejen(l5:0) 


length- 1 , m bytes, of operand for tiling operation 



Pixel Organiser Registers 



co. po_cfg 
Table 1 .94 po_cf g Register Relds 









[26:241 


burst[2:0] 


iog case 2 ot maximum burst length for input data. Data ~ 
bursts ane aligned to addresses of this value. 


[22:21] 


char. 


Number of input channels for JPEG compression : 
0 . 1 = Single channel 
2 = 3 channels 
3-4 channels 


(20] 


SS 


0 - no subsampliiig 

1 =■ perform subsampling 


119] 


F 


Filler option for subsampli ng : 

0 = no meeting (use every second byte) 

1 ~ apply filtering (average) 


[18:16] 




Reserved 


[15] 


dst 


destination foe PBus data : 
0 = MDP 
1=JPEG Coder 


[14:12] 


irKxJe(2:0) 


Operating mode for PO: 
0-idle 

1 - sequential mode 

2 « JPEG Compression 

3 = CSC 

4 to 7= Convolution'Imagc Transformation j 


[11:4] . 




Reserved 


13J 


S 


0-not in self test mode 
I =» self test mode 


[2J 


B 


0 = not bypassed 

1 = bypassed 


[1J 


E 


0=dtsabfed 
\ «= enabled 


[0] 




Reserved 
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505 

cp. po^siat 



Table 1.95 po_stat Register Fields 



' Field ' ' 


; Na'm6 ; - 


•x - ? x V : ' ' ■ Description <_ > ;■ - / . , ■ <*■ 


(31:27] 




Reserved 


[26) 


muv_miss 


0- no MUV miss 
! = MUV miss 


125) 


po_stall 


0 = PO not stalled 

1 = current PO operation stalled 


[24] 


po_active 


0 = PO not active 
l~PO active 


[20] 


sni_the copro 
ccssort 


PO the coprocessor! state 


[19] 


ack_rcvd 


ibus ack received 


[18] 


ibu5_req_madc 


ibus req made 


[17] 


fifo Jul! 


FIFO full 


[16] 


fifo_crnpty 


FIFO empty 


[15:0) 




Reserved 


cq. po_err_int 
po_er_int Register Fields 


<&*> • 7" 






[7:01 


error 


error condition : 

bit 0 = IB us i_ack asserted to PO while disabled 
bit 1 = both PO and RO using MUV RAM simultane- 
ously 

bit 2 - jpeg mode data overflow 


cr. po_err_ir 
po_erMnt_er 


it_en 

\ Register Fields 










err_niask 


error mask : 

bit 0 - IBus Lack asserted to PO while disabled 
bit 1 = both PO and RO using MUV RAM simultane- 
ously 

bit 2 - jpeg mode data overflow 
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cs * po„drnr 



1 .98 po_dmr Register Fietds 



• : ::: Field.-- V- 






; *V . - * Description — : 


(3 1 :30] 


Is3 


lane swap for byte 3 : 

0 = from byte 0 

1 = from byte 1 

2 = from byte 2 

3 = no swap 


(29:28J 


IS2 


lane swap For byte 2 : 

0 = from byte 0 

1 = from byte I ; 

2 = no swap 

3 « from byie 3 


[27:26] 




lane swap for byte I : 

0 - from byte 0 

1 - no swap 

2 = from byte 2 

3 - from byte 3 


[25:24] 


IsO 


lane swap for byte 0 : 
0 = no swap 
1= from byte 1 

2 = from byte 2 

3 » from byte 3 


[23:20] 


suben(3:0] 


Byte Substitution Enables : 

0 = do not substitute data from po_subst register for 
mis Dyte 

1 = use po_subst register value for this byte 


[19:15] 


rep!icate[4:0] 


replication Lount : 

indicates number of additional imvml data items to 
generate 


(14:12] 


nornLfactor[2 


Normalisation factor for input data : 

0 - 1 bit per component 

1 =» 2 bits per component 

2 ~ 4 bits per component 

3 = 8 bits per component 

4 to 7 * 16 bits per component 


[111 




Reserved ~ 


[10:8] 


bo[2:0] 


bit offset within byte; 

Bit addressing is big endian 


P] 


P 


External format : 

0 = input data consists of unpacked bytes 
1 « input consists of a packed stream 
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Table 1.98 po_dmr Register Fields 







! v^U: ' x . , Description * , 


(6:5) 


1 if[I:0] 


Internal format of data na^prf id nnrnur * 

0 = pixels 

1 a unpacked bytes 

2 = packed bytes 

3 - other 




(4:3) 


cc(l:0] 


Channel Count for input stream : 

0 = 4 active channels 

1 = 1 active channel 
2-2 acti vc c hannels 
3=3 active channels 


— 


(2) 


L 


0 = immediate operand ("short" format) 

1 direct addressing ("long" format) 




(1:0) 


whai[l:0] 


DMU addressing mode : 

0 = bypass 

1 « sequential 
2= tiling 

3 « constant 





ct po_subst 



Table 1 .99 po_subst Register Fields 







Mil 




-v.* .>v- & r" £ ^ ••• • 


31:0 


subst[3t:0] 


value of substitution data 


cu. po_cdp 










po_cdp Register Fields 
















, - . - » - u ' . 


31.0 


Cdp[31:0] 


address of current data item 



cv. pojen 



Table 1.101 pojen Register Fields 



Field ' 

• 


' | Nlatne • f&. 




31:0 


lcn[31:0] 


length of the operand 
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cw. po_said 



Tabic 1.102 po_said Register Fields 



; s: Field ; ; 


- --r.Nsrne-,' *' 


♦ : 1 /- Descriptions, , ' : • • 


[31:0] 


ad 


the coproccsjon address of operand data (or immediate 
data value) 



ex. pojdr 



Table 1.103 pojdr Register Fields 



mmam 






[31:0] 


width[31:16] 
hdghr[15:0] 


width • I , in pixels, of current image 
height - 1 , in lines, of current imase 


cy. pomu 
1 po_jnuv_ve 


v_ya\\d 

lid Register Fields 








[31:0] 


valid 


V alid bi is for MUV lines 



czl po_,muv 



Table 1.105 pojnuv Register Fields 











[31:111 


mtivjjasc_addt 


base address of MUV RAM 



Main Data Path Registers 



da. . mdp^cfg 

AH bits are reset to 0. 

Table 1.106 mdp_cfg Register Fields 











i 


E 


0 - disabled 

1 = enabled 




2 


8 


0=noc bypassed 
\ =bypassed 




3 j 


T 


0«not in self lest mode 
I -self icst mode 
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Table 1.106 mdp dg Register Fields 





•j', Name'\, 




14 


decode 


0 = Instruction decoding disabled 

1 = instruction decoding enabled: mdp_opl and mdp op2 
set up from radp_cfg (self clearing) 


15 


operate 


die coprocessor! bit (self clearing) 


17 


word input 


0 - horizontal interpolation with byte inputs 

1 — iiuii/jjuum nii.ci|AjidLiuu wnn woro input 


18 


long_krnl 


formal of kemaj discriptot 

0 - short format 

1 - long format 


19 


blend_gen 


0 - blend generation not required 

1 blend generation required 


31:20 


Instruct 
tion[15:OJ 


Instruction major and minor opcodes 



db. mdp_stat 

All bits are reset to zero 

Table 1.107 mdp_stat Register Fields 





1 Nam a 1 ~* ^srSffA-- - "1' " 


0 


po_valid 


status of *po_vaJid' sigral 




po_final 


status of 'poJinaT signal 


2 


po_stali 


status of 'po^stair signal 


3 


oob_vaiid 


status of *oob_ack* signal 


4 


oob_req 


status of 'oob^req* signal 


5 


oob _pendin 

g 


0 « no outstanding requests in OOB interface 
J *=■ outstanding requests in OOB interface 


6 


ooc_valid 


status of *ooc_ack* signal 


7 


ooc_req 


status Of *ooc_rcq* signal 


8 


ooc_j>endin 

g 


0 = no outstanding requests in OOC interface 

1 - outstanding requests in OOC interface 


9 


ro_valid 


status of 'rcuvalid* signal 


10 


rojinal 


status of 'roJlnaT signal 


11 


ro_stall 


status of l io_scair signal 


12:13 . 


matmul slat 
e[1:0] 


matrix multiplication state 

0 - idle 

1 = multiplying the left half of matrix 
2 -multiplying the right half of matrix 


14:15 


int_stat(1:0] 


inteipclaLionby fixed proportion siates: 

0 = cycle 0 

1 = cycle 1 

2 = cycle 2 

3 - cycle 3 
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Table 1.107 rndp_slat Register Fields 



IlBiill 


mm-; S 


• ^ -Description ' : " ',< Tr l?Z77 


16 


jump 


ramp generator mode (in Wend generation) 

0 = step mode 

1 » jump mode 


17:18 


addgen stat 
e[l:0] * 


state or the Address Generation state machine " 
0 = idle 

2 = generating address 


19:20 


ramp_state 


state of rampgeneration state machine 

0 = idle 

1 - determine mode of operation 

2 = generating blend 



dc. mdp_errjnt 

All bits are reset to zero. 



Table 1.108 mdp_errjnt Register Fields 



: Field m 
7:0 


error[7:0] 


enorfO] = receive data from PO when disabled or MDP is 
not the coprocessoned 

errorl i ] - receive data from OOB when disabled or MDP 
is not the coprocessoned 

error(2] = receive data from OOC when disabled or MDP 
is not the coprocessorted 


24:8 


excep- 
tion{15:0) 


excepuon(0] = underflow or overflow in damping in 
channel 0 ° 

exception[l] - underflow or overflow in cl ampin* in 
channel 1 & 

exception^] ~ underflow or overflow in damnino in 
channel 2 

exception[3J - underflow or overflow in damping in 
channel 3 F B 

exception^] - x co-ordinate underflow (image transfor- 
mations and convolutions) 

exception[5J - y co-ordinate underflow (image transfor- 
mations and convoluuons) 
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2V 

dd. mdp_erMnl_en 
All bits are reset to zero. 
Table 1.109 mdp_err_int_j?n Register Fields 



r RefcJ ? > . 








7:0 


err mask[7:0 
] 


mask error condition 

0 -= masked 

1 - not masked 

err_rnask[0] = receive data from PO when disabled or 

MDP is not the cop« ocessoned 

err_mask(l) « receive data from OOB when disabled or 

MDP is not the coprccessorted 

ca_mask[2) « receive data from OOC when" disabled or 

MDP is not l\k coproccssorted 


24:8 


exp mask[i 
5:0] 


mask exception 
0- masked 
I *- not masked 

exp_mask[OJ = underflow or oveiflow in clampino | n 
channel 0 r a 

exp_mask[ 1 J = underflow oi overflew in clamnin* in 
channel 1 

exp_mask[2] = underflow or overflow in damping in 
channel 2 

exp_mask[3] - underflow or overflow in clamping in 
channel 3 

exp_mask:[4] = x co-ordinate underflow Gmase transfor- 
mations and con vol uU ons) 

exp_mask[5] - y co-ordinate underflow Cima.ee transfor- 
mations and convolutions) 



de. mdp_test 
AH bits are reset toO 

Table 1.110 mdp_test Register Fields 









3:0 


test_dataf3:0] 


The source of data to RO under test mode 

0-3 - RO data is from PO data (setting to 0. 1 . 2 or 3 

would have the same effect) 

4 - RO data is from oob_data[3 1 :0) 

5 - RO data is from oab_data[63;32} 

6 = RO data is from oob_data[95:64] 
7- RO data is from oob_data[l 27:96] 
8 = RO data is from ooc_data[3l:0) 
°- RO data is from ooc_data[63;32] 

10 - RO data is from ooc_data [95:64] 

11 =R0 data is from ooc_data[ 127:961 
12-15 « reserved 


25:4 


resQrved 
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Table 1.110 mdpjest Register Fields 



: - .: Field />■-■. U'\ 






29:26 


output delayf 
3:0] 


the number Of Clock CVCles between inrmt arvl rtnfTiiir *» 


31:30 


skeletal mod 
e[1:Q] 


mode of skeletal MDP 

0 - PO to RO with some delay 

1 = PO -> OOB -> OOC -> RO with some delay 

2 = me least significant byte of the data from PO, OOB 
and OOC are combined to give the result word to RO. 

3 = reserved 



df. rndp_op1 

All bits are reset to zero 

Table 1.111 mdp_op1 Register Relds 







mmmmmmam i man 


2:0 


ppb modeA[2 
:0J 


mode of multi-function block A in Reprocessing Block 


5:3 


ppb_modeB[2 
:0] 


mode of mulu -function block B in Preprocessing Block 


8:6 


pba modeA[2 
:0] 


mode of multi-function block A in Sta*c A Processing 
Block 


11:9 


pba_modeB[2 
:0] 


mode of multi-function block B in Stage A Processing 
Block 


14:12 


pba modeC{2 
:0] 


mode of multi-function block C in Stage A Processing 
Block 


17:15 


pbb modeA[2 
:0] 


mode of multi-function block A in Stase B Processing 1 
Block 


20:18 


pbb_modeB[2 
:0) 


mode of multi-function block B in Stage B Processing 
Block 


23:21 


pbb modeC[2 
:0] 


mode of multi-function block C in Stage B Processing 
Block 
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Table 1.111 mdp_opi Register Fields 





iuni 


r ■* • • Desert pnon < <c * • : 


27:24 


inst.type(3:0] 


Type of insrnic/Jon: 

0 = no_op (which implies thai (lie MDP does not for that 
instruction) 

1 - GCSC 

2 = reserved 

3 = Matrix Multiplication 

4 = Convolutions 

5 = Image Transformation 
Preserved 

7 -Half Toning 

8 « Residual merging 

9 - bypass (which implies that MDP only needs to pass 
data from PO to RO) 

10 =» Horizontal interpolation 

1 1 = Vertical Interpolation 
12-13 «corapositing 
14-15 * reserved 


28 


Itjnt 


Interpolation required m Image transformation 
0-no interpolation 
1 = interpolation 


29 


iLacc 


Accumulation required in Image transformation 

I l\f aL-L UXJlUiaXlUll 

1 = accumulation 


30 


comp_unpre 


Uihpre -multiplication required in Compositing 
0-no un-pre-multiplicatioo 
1 « un-pre-mulilpiication 


31 


comp_blend 


Blend generation required in compositing 

0 « no blend generation 

1 = blend generation 



<Jg* mdp_op2 

All bits arc reset to rero. 



Table 1.112 mdp_op2 Register Fields 







*v •^^-siiiii:' y ■•• - -. - 


1:0 


mul_A[1:0] 


multiplicand of Operand A pixel 

0 = 0 

1 = 1 

2 = opacity B 

3 = -opacityB 


3:2 


muLB[1:0| 


multiplicand of Operand B pixel 
0-0 

1 = 1 

2 » opaciiyA 

3 = -opacityA 
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Tabic 1.112 mdp_op2 Regisier Fields 





■ I -Nam ■ 


: — ^ .;-;-< Description " — * - ; . 


4 


reverse 


0 = do not reverse operand in compositing 

1 - reverse operand in compositing 


5 


addgen_mod 
e 


Address generation mode 

0 « ima«e transformation mode 

1 = convolution mode 


6 


selLcfg 


0 = long kernel descriptor 

1 - short kerne) descriptor, other parameters are seL r configured 


7 


reserved 




o 
o 


ag__the 
caprocessort 


address geneiadon the coprocessort bit It is cleared-when the 
address generation is finished 


9 


bgjhe 
coprocessort 


blend generation the coprocessort bit. It is cleared when the blend 
generation is firtished. 


10 


matjhe 
coprocessort 


matnx multiplication tlx: coprocessor! t>iL it the coprocessorts the 
matrix multiplication state machine inside the MDPIL It is cleared 
by that state machine when it sees the P o_f inal signal asserted. 
(Only valid for Matrix Multiplication instruction.) 


11 


int_thc 
coprocessort 


interpolator! the coprocessor! bit. It the coproccssoas the interpo- 
lation state machine inside MDPIL Once the coproces sorted, this 
tat is set until the state machine sees t>o final k a«»iti»H sx M i.. 
valid for horizontal and interpolation instruction) 


12 


int_size 


0 = interpolate with packed bytes 

1 - interpolate wi dt unpacked bytes or pixels 
(only valid forhori2ontal and vertical interpolation) 


13 


int_4 


0 = interpolate by a factor of 2 

1 = interpolate bv a factor of 4 

(only valid for horizontal and vertical interpolation) 


14 


int_vertical 


0 = horizontal interpolation 

1 =s vertical interpolation 


15 


fock_step 


0 « the PO, OOB and OOC interfaces are operating independent of 
each other 

1 = the PO, OOB and OOC interfaces are locked together, so they 
will only accept data together. 


19:16 


reserved 




21:20 


cw config[l: 
0} 


CI am p. or- wrapper configuration 

0 = wrapped and no absolute value 

1 - wrapped and absolute value 

2 = clamp (overflow to OxFF, underflow to 0x00), but no absolute 
value 






3 « absolute value and clamo 


22 


fr_en 


Fraction Rounder configuration 

0 - disabled (returns 0) 

1 ^enabled 
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Table 1.112 rndp_op2 Register Fields 







r ' - • : / '..Description ; : 


24:23 


oob mode[1: 
0] 


Mode of OOB interface operation: 

0 - disabled 

1 = sequential 

2 = random 

3 = blend_generation 


26:25 


ooc model 1: 
0] 


Mode of OOC interface operation: 

0 = disabled 

1 = sequential 

2 = CCSC 

3 = pixel 


30:27 


trans(3:0] 


In Compositing operation 

0 = do not subtract offset for this channel 

1 =» subtract offset for this channel 

In Colour Space Conversion operation 

0 = do not apply translation and clamping to output value on this 
channel 

1 = use uiuislatioi] and clamping on this output channel 
In Image Transformation or convolution operations 

0 = initialise accumulator to 0 for this channel 

1 « initialise accumulator to rndp_poi:O000 for this channel 


31 




reserved 



dh. mdp_j5or 

All bits are reset to zero. 

Table 1.113 rndp_por Register Fields 







~ - ~ t | :J ; * H^cripfi □ n * - * * *■* ~ ' * 


[7:0) 


ofTsetO[7:0] 


offset for plus operator on channel 0 


[15:8| 


offset 1 [7:0] 


offset for plus operator on channel I 


[23:16J 


ofTseL2[7:0} 


offset for plus operator on channel 2 


[31:241 


offsets [7:0] 


offset for plus operator on channel 3 


[31:0] 


orTset[31:0) 


offset for convolutions and transformau" ons 



di. mdp_bi 

All bits are reset to zero. The mdp„bl register is used for different things in different modes: 



Table 1.114 mdp_bi Register Fields (compositing mode) 





gf Name' ; - 


* :t: — ' Oescrijption, ^ ■ ■ y.. r 


[7:01 


blendendO . 


the coprocesson value of blend on channel 0 


[15:8) 


Wendend 1 


the coprocessor! value of blend on channel 1 
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Table 1.114 mdp_bi Register Fields (compositing mode) 



•Field. 




Description '• 


[23:16] 


Wendend2 


tlie coprocesson value of blend on channel 2 


[31:24] 


bletKknd3 


the coprocesson value of blend on channel 3 



Table 1,115 mdp_bi Register Fields [non-compositing mode) 





;-P Name ■'<:.- 


■ • - - . Description ' • ' -- - * - - 


[31:2] 


i offset 


offset imo the index table 



dj. mdp_bm 

All bits are reset to 7ero. The mdp_bm register is used for different things in different 
modes: 



Table 1.1 16 mdp_bm Register Fields (compositing mode) 









[7:0] 


blendendO 


end value of blend on channel 0 


[15:8] 


blendend) 


end value of blend on channel 1 


[23:16] 


blendend2 


end value of blend on channel 2 


131:24J 


biendencB 


end value of blend on channel 3 



Table 1.117 mdp_bm Register Fields (non-compositing mode) 









[3:0] 


rows[3:0] 


number of rows in the matrix 


[7:4] 


co!s[3:0) 


number of columns in the matrix 


[15:8] 


level[7:0] 


number of levels in halftoning 


[20:16] 


bp[4:0) 


location of binary point 



dk. mdpjen 

All bits are reset to zero. 

Table 1.118 mdpjen Register Fields 







31:0 


length 


length of blend to be produced 
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2lfi 

JPEG Coder Registers 
dl. jc_cfg 

Table 1.119 jc_cfg Register Fields tor JPEG instructions 



Field (Bits) j-Name 


- - - . , w~i 4 v r ~ - ' < 


[31:28] 


mop 


Major opcode from instruciion - 00 1 0 * 


(27J 


D 


0 JPEG compress 

1 = JPEG decompress 


(26] 


M 


0 ~ single colour channel 

1 = multiple colour channels 


[25] 


4 


0 = three channel 

1 » four channel 


[24] 


S 


0 = do not use subsampling regime 

1 = use subsarnpluig regime 


[23] 




reserved 


[22] 


H 


0 » use fast huffman algorithm 

1 «= use slow huffman algorithm 


[21:16] 1 


reserved 


[15] 


0 


0 = JCisnot operational 

1 - JC is operational 


[14] 


dec 


0 = disable decoding of instruction 

1 - enable decoding of instruction 


[13:8] | 


reserved 


[7] 


A 


0 = align rethe coprocessor* markers to byte bounda- 
ries 

1 - align rethe coprocesson markers to word bounda- 
ries 


[6] 


Z 


0=padwithls 
I - pad with Os 


[5:4] 




reserved 


[3] 


T 


0 = not insclf test mode 
1 -in self test mode 


[2] 


B 


0 = not bypassed 

1 « bypassed 


[1] 


E 


0 = disabled 

1 = enabled 


(0) 




reserved 
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4* 

if • 



6> 



Table 1.120 jc_cfg Register Fields for data coding Instructions 









131:28] 


/nop 


Major opcortc from instruction = 0010 


(27) 


D 


0 - compress 

1 = decompress 






reserved 




ailt 


Difference between ire number of input byies and 
the number of output bytes: 

00 no diffcrrtw 

01 - one extra output byte than input byte 

10 » 1 1 — one less inDtit hvtr than rmfnnr hvtr* 


[23] 


op 


operation: 

0 = huff man 

1 = predictive code 






reserved 


[18:16) 


ibo 


input bit offset 


[15] 


O 


0 JC! i*i not nn^rAlinn-al 

1 ^JCis operational 


[14] 


dec 


0 - disable decoding of instruction 
3 = enable decoding of instruction 


[13:TJ 




reserved 


[6} 


Z 


0 = pad with 1$ 

1 - pad with 0s 


[5:4] 




reserved 


[3] 


T 


0 = not inself test mode 
1 -in self test mode 


[2] 


B 


0 o not bypassed 

1 - bypassed 


[U 


E 


0*= disabled 
I ■ enabled 


[0] 




reserved 



Table 1.121 Jo_cfg Register Fields for memory copy Instructions 









[31:28] 


mop 


Major opcode from instruction - 1001 


[27] 


D 


0 = general purpose data movement 

1 = local DMA 

This bit should always be set to *0 ' 


126] 


. ft 


0 « not a bit copy operation 

1 = bit copy operation 
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Table 1.121 jc_cfg Register Fields for memory copy instructions 



Field (Bits) Name \i 


« ^^> K t Description 7A ^'ty/S 


[25:241 


dirr 


Difference belween ihe number of input bytes and 
00= no difference 

01 — one extra oulDut bvte tlian innnf hvi#» 

10 = n = one less input byte than output byte 






reserved 1 


(22:20) 


obo 


output bit offset 


119] 




reserved 


(18:16) 


ibo 


input bit offset x. 


(15) 


0 


0 «= J C is not operational 

1 «=* jv, j 5 opcrauonai 


(14) 


dec 


0 « disable decoding of instruction 

1 = ptu Till* riprfwllnif nr In cf n i/*fi/%n 


(13:4) 




reserved 


(3) 


T 


0 - not inself test mode 

1 « in self tesi mode 


m 


B 


0 = not bypassed 






1 = bypassed 


(1) 


E 


0- disabled 
1 = enabled . 


(0] 




reserved 



dm. jc_stat 



Table 1.122 jc_stat Register Fields 









(31:8] 


reserved 




(7:0) 


' state 


(o be finalised 



dn. jcj*rr_int 



Table 1.123 jc_err_jnt Register Fields 



mmm. 






[31:19] 




reserved 


[18] 


huffJlLtablc 

5 


illegal huffman table. More than nine huffman tabic 
heap misses occured. 


(17] 


hufF_ilLmpo 

■ s 


illegal marker posidon 


(16] 


hutf_itl_rnar 
ker 


illegal marker 
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250 

Table 1 .123 |c_err Jnt Register Fields 





; i Name t - 


' - "\ ; Description \. /:v •:• 


(15] 


coefMILAC 


illegal AC coefficient value (-1024) during 


[14] 


codT_ilI_DC 


illegal DC value 1 


113) 


cocffJll_AC 
_mag 


illegal AC magnitude category 


[12] 


COeffjU_DC 
_mag 


illegal DC magnitude category 


[HI 


coeffjlLRS 

T 


illegal KST m crxmi value 


[10] 


coeff_oveiflo 
w 


datA overflow detected by cceff coder * 


[9] 


coeflliIi_mp 

OS 


illegal marker position 


[81 


jpeg^undeifi 
ow 


underflow 


[I] 


jpeg^disable 
d 


received data while disabled error 


[0] 


huff_HI_sym 
bol 


illegal huffman symbol error 



do. jc_errjnt.en 



Table 1.124 jc_errjnt_en Register Fields 









[31:19] 




reserved 


(18) 


huffjlljable 
s 


illegal huffman table. More than nine huffman table 
heap misses occured. 


(17] 


hufFJHjmpo 
s 


illegal marker posldon 


[16] 


nuff_ill_rnnr 
ker 


illegal marker 


[15] 


coeffJlLAC 


illegal AC coefficient value (-1024) during 


[14) 


coeffJILDC 


illegal DC value 


U3]. 


coeffjlLAC 
_mag 


illegal AC magnitude category 


(12] 


coeffjlLDC 
_rnag 


illegal DC magnitude category 


HI] 


coefTjlLRS 
T 


illegal RST m count value 


[10] 


coeff_overflo 
w 


data overflow detected by coeff coder 


[9J 


coeffjlLmp 

OS 


illegal marker position 


[8] 


jpe£_underfl 
ow 


underflow 
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I able 1.124 jc_errjnt_cn Register Fields 





Name ' v 




in 


jpeg^disable 
ci 


jeteived data while disabM error 


(0) 


hufT_!ll_sym 
~bol 


illegal huflmaii symbol error 



dp. jc^rsi 



Table 1.125 jc_rsi Register Fields 



IlililSi 


. ^Narne v : :; 




[15:03 


rsi 


number of MCU blocks between rcthe coproccssoit 
markers 



dq. jc^decode 



Table 1.126 jc_decotfe Register Fields 



1PM 


m:-mm ■ . 


: r.-i:. PeicrfritiAn -~ < ' 
^^^^^ r - : ,, r ,u«Bcqpiiop / . , , . . 




dct_enable 


enable da submodule 


II] 


dct_bypass 


put dec submodule into bypass mode 


|2] 


dctjorward 


put dct into forward mode 


m 


qdcLenablc 


enable quantizer submorfuJe 




qdq_bypass 


bypass quantizer submodule 


[5] 


qdq_forward 


put quantizer into forward mode 


IQ 


qdq_four 


four channel image 


PI 


qdo^ubsmpi 


subsarnpled image 


[81 


cc_enable 


enable the eoefF coder submodule 


[9J 


ccj>ypass 


bypass coeff coder submodule 


00) 


cc_forwairJ 


put coeff coder into forward mode 


[111 




code Jpeg compliant stream 


02) 


cc_subsmpl 




03) 


ccjburchannel 




04) 


cc^muitichannel 




[15} 


hc_cnable 


enable Huffman coder submodule 


[161 


hOypass 


bypass luiffman coder 


0V) 


hc_Forward 


put huff man coder into forward mode 


08) 


hc_su tempi 


subsampled image 


[19] 


hcjast 


perform fast huffrnan coding 


[20] 


hcjpcg 


perform jpeg compliant huffrnan coding 


[21] 


hc_four 


four channel ima«e 
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Table 1.126 jc_decode Register Fields 



' Flfeld ^ 


.?N*rne : 


- « • ' s : ■ ■ , f ru 

'; , : .description' . - 


(221 


hc_align 


align RST m markers on word boundaries 


|23] 


he., zero pad 


0«= pad using * I *s 
i = pad using 'O's 


[2«] 


r^memcopy 


perform memory copy operation 


1251 


-iscjorwrd 




126) 


qdq_multi 




127) 


hc_mu)ti 




[31:221 




reserved 



dr. jc_res 



Table 1 .127 [c_res Register Fields 





■•Narne : 


I • IE 






(7:0j 


res 


Residual value 



ds. jcjable^sel 



Table 1 .128 jc__tab!e_sel Register Fields 



wmum 






[t3:&] 


jc_table_$el_ 
quant 


decoded table selection for quantisation 


{5:0] 


jcjaHc sel 
huff 


decoded tabic selection for huffman decoding 



Result Organiser Registers 



dt. ro_cfg 
Table 1.129 ro_cfg Register Fields 





M 






[31:231 


reserved 




[22:21) 


chart 


JPEG decompressed output data format : 
0, 1 = Single-channel 

2 = 3 -channel 

3 - 4-channel 


[20] 


upsimple 


0 =s no upsarnpling 

1 = upsamplc data from MUV RAM 
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Table 1.129 ro^cfg Register Fields 



Field-: 


•: Name' ; • 


: \; . A\ ''Description : >' ; • V- •• ^"V : f : : v:,.;7;. : H- 


[19:18] | 


reserved 




[17] 


use_cui 


0=do not use value in cut register to cut ouiput data. 

l=use value in cut register to cut output data. 

This bit is reset back to 0 on completion of an instruction. 


[16] 


use_lirnit 


0=dO not use value in limrl rPOiCUM' fn limit nntrmi Hain 

l^use value in limit register to limit output data. 

This bit is reset back to 0 on completion of an instruction. 


(15:14) 




Reserved 


[13:12) 


mode[l:0] 


Current mode of operation: 

0 « idle 

1 = sequential 

2 « JPEG decompression 

3 = CBus data 






Reserved 


[3] 


S 


0 «= not in self test mode 

1 = self test mode 


[2] 


B 


0 = not bypassed 

1 = bypassed 


(1) 


E 


0 =■ disabled 

1 - enabled 


[0] 




Reserved 



du, ro stat 



Table 1.130 ro_stat Register Fields 







mm. * - iisteip i \ 


[31:24) 




Reserved 


{23:16) 


stace[7:0] 


internal state condition : 
bit 0 = RO stalled 
bit I =RO active 
bit 2 = FIFO full 
bit 3 « FIFO empty 


[15:0] 




Reserved 
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dv. ro_err_int 
fable Lt3t ro^errjnt Register Fields 



Field "r 


i Name. 


1:1 ^ O . Description 


17:0J 


error 


error condition : 

bit 0 - lequcsc when disabled 

bit 1 = toth MDP and JPEG Coder active at same time 

bit 2~ illegal address error 

bit 3 - jpeg mode data overflow 



rJvv. to err int en 



Table 1.132 ro__err_int_en Register Fields 




dx. ro drnr 



Table 1.133 ro_dmr Register Fields 





•Name 




[31:301 




lane swap for byie 3 : 
0- from byte 0 

1 = from byte 1 

2 = from byte 2 

3 « no swap 


[29:26] 


152 


lane swap for byte 2 : 
0- from byte 0 
1 « from byte 1 

2 - no swap 

3 * from byie 3 


[27:26] 


Isl 


lane swap for byte I : 

0 « from byte 0 

1 = no swap 

2 - from byte 2 

3 = from byte 3 


(25:24] 


IsO 


lane swap for byteO: 

0 ^ no swap 

1 * from byte I 

2 = from byte 2 

3 = from byie 3 
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Table 1.133 ro_dmr Register Fields 



Table 1.134 



Fifttd 


•->■ Name-T'-I? 

Y^:::::-^■■'^■^:■.-;^■■::-x«-■'-■^'•■ l ;S 


' \ \ ' SiS?^ • '' : ; De jsc rlpli 6 n : ; % . ... : pi 


(23:20) 


suben[3:0] 


Substitution Enables : ~~ " 

0 - do perform substitution for Uii s byle 

1 — use value sronpd in rr* siihct fnrrhic hi»r« 


[19:16] 


wrmask 


Write Masks : 

0 = write out corresponding byte channel 

1 = do nor write out corresponding byte channel 


[15} 


cmbs 


Choose most significant bits 

0=choose least significant bits of a byte when per- 
formin° dcnormali^flMrm 

1= choose most significant bits of a hvt? wbm tv»r- 
forming denorraaJisation 


[14:12] 


normalise 


Denormalisation factor ; 

0 = 1 bit data objects 

1 «2 bit data objects 

2 = 4 bit daia objects 

3 = 8 bit data objects 

4 to 7 - 1 6 bit data objects 


IU:8] 




Reserved 


[7] 


P 


External Format ; 

0 * unpacked bytes 

1 ~ packed stream 


[6:5] 


if 


IntemaJ Format : 

0 = pixels 

t = unpacked bytes 

2 = packed bytes 

3 = other 


[4:3] 


cc 


Channel Count : 

0 = 4 active channels 

1 ts l active channels 

2 = 2 active channels 

3 = 3 active channels 


[2:0] 




Reserved 


f. ro_subst 

>_subst Register Fields 








[31:0] 


subst[3l:0] 


substitution value or data value for Cbus mode 
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d*. ro_cdp 



Table 1.135 ro_cdp Register Fields 



m mm 


Name 




(31:01 


cdpf31;0) 


address of current data item 



ea. ro len 



Table 1.136 ro len Register Fields 









P1:0] 


1fcn[31:0] 


Output Byte count 



eb. ro_sa 



Table 1.137 ro_j&a Register Fields 







[31:0] j sa[3i:0J 


the coprocessort address 



ec, ro_idr 



Table 1.138 rojdr Register Fields 







131:0] 


height(l5:0] 
width[3i:16] 


width - 1 , in pixels, of current image 



ed. ro vbase 



Table 1-139 ro_vbase Register Fields 













[31:12] 


vbase(3i:I2] 


ihc coprocessor Vimiai Address Base 



ee. ro_cur 
Table 1.K0 rojeut Register Fields 







31:0 |cut[31:01 


output cut Offset: throw this many bytes away 
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> 



if 



el- ro Imt 



Table 1.141 rojmt Register Fields 









lnx(31;0) 


lmt[31:0| 


limit to the number of output bytes 



PC! Configuration Space Alias 

PCI configuration space is a 256-byte block of registers defined in the PCI spec, which 
allows the host 10 configure the PCI device, ar.d to read its status. It is accessed using PCI 
configuration cycles. The register conre.nrs arc also mirrored into a read-only area of the 
coprocessor's internal memory space, so that they can be read via norma] PCI bus memory 
cycles. 

The f 01 mat of ilie configuration space implemented in (tie E1C is shown in Mgure 1 .1 



Figure 1.1 the coprocessor PCI Configuration Space Layout 



31 1Q 15 _o 



Device ID 


Vendor iD 


0x00 


Status 


Command 


0x04 


Class Code 


Revision ID 


0x08 


Reserved 


Header Type 


Latency 
Timer 


Cache Line 
Sire 


OxOC 


Base Address 


0x10 


Reserved 


0X14-0X28 


Subsystem ID 


Subsystem Vendor ID 


0x2C 


Reserved 


0x30-0x38 


MaxJ-at 


Min_Gnt 


Interrupt Pin 


Interrupt Line 


0X3C 



Reserved registers, and reserved bits of implemented registers, will return 0 on reads and 
will not be affected by writes. Configuration space addresses in the range 0x40-0xFF are 
also reserved - no vendor specific configuration registers are denned. 

eg. Vendor ID 

This register is read only. CiSRA's Vendor ID is 0x1 1 AC. 

eh. Device ID 

This register is read only, the coprocessor's Device ID is OxOOOl . The Device ID field is split 
up into two S bit fields: the most significant 8 bits is a number characteristic of the device 
(0x0 for the coprocessor) and the least significant 8 bits represents a version number of that 
device (Ox 1 for this version of the coprocessor). 
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ei- Command Register 

The definitions of the fields in the Command register arc shown in Tabic l.!42. 4J1 non- 
reserved bits in this register arc read/write. After reset, this register is set to OxOOOO. 

Table 1.142 the coprocessor PCI Command Register Fields 



Fin!d(btts)j • 




S:?'- ' / Definition r : i -£H 


0 


10 Space 


Reserved. 


1 


Memory Space 


Enables the coprocessor to respond to memory 
cycles. 


2 


Bu5 Master 


Enables the coprocessor to generate cycles on the 
PCI bus. 


3 


Special Cycles 


Reserved. 


4 


Memory Write & 
Invalidate Enable 


Reserved. 


5 


VGA Palette 
Snoop 


Reserved. 


6 


Parity Error Re- 
sponse 


Enables ihe assertion of perr_l on data parity 
errors, and serr_l on address parity errors Of 
bit 8 is also set). Also enables the setting of the 
Parity Error Detected bit in the Status register. 


7 


Wail cycle control 


Reserved. 


8 


serr_L Enable 


Enables the assertion of serr.L the coprocessor 
will only assert serr_l on address parity errors. 


9 


Fast back-to-back 
Enable 


Reserved' 


10-15 


Reserved 


Reserved. 



ej. Status Register 

The definitions of the fields in the Status register are shown in Table hi 43. Reads to this reg- 
ister behave normally. Some bits of this register axe read-only. The other bits can be set to 1 
only by the coprocessor, and can be reset to 0 only by the host (except in test modes). The 
host resets a status bit by writing a 1 to that bit; writing a 0 to any bit has no effect. After 
reset, this register is set to 0x0280, 

Table 1.143 the coprocessor PCI Status Register Fields 



■■Hi 






0-4 


Reserved 


Reserved 


5 


66MHz capable 


Reserved. 


• 6 


User Definable 
Features 


Reserved. 


7 


Fast Back-to- 
Back Capable 


This bit is read-only, and set to 1 . It indicates that 
the coprocessor behaves correctly in a system 
where fast back-to-back cycles are addressed to 
" different target devices. 
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Table 1.143 the coprocessor PCI Status Register Fields 



;E3elcf(bll5)-j Noma,; ;f 




s 


. Data Parity Error 

DftttttGd 


Tliis bit is set whenevei the coprocessor, as a mas- 
ter, detects a dara parity error in reed data, c* 
detects perr_l asserted on a write. It is only sei 
if bit 6 of die command register is set. When this 
bit is set, an interrupt is signalled 




devsel_l tim- 
ing 


These read only bits are set to 01 , to indicate tliat 
the coprocessor responds asserts its devsel_l 
with medium speed when it is a target. 


1! 


Signalled Target 
Abort 


Reserved. 


12 


Received Target 
Abort 


This bit is set whenever the coprocessor, as a mas- 
ter, receives a target abort It causes the EIC to 
generate an interrupt, and to stop its operation 
until the bit is cleared by software. 


13 


Received Master 
Abort 


This bit is set whenever the coprocessor, as a mas- 
ter, executes a master abort. )l causes the EIC to 
generate an interrupt, and to stop its operation 
until the bit is cleared by software. 


14 


Signalled System 
Error 


This bit is ser whenever the coprocessor asserts 
serr_jL. Tliis will only occur on an address par- 
ky error. 


15 


Detected Parity 


This bit is set whenever the coprocessor detects an 
address or data parity error. 



ek. Revision ID 

This is a read-only register, the coprocessor's initial revision ID is 0x01 . 
el. Class Code 

This is a read-only register, the coprocessor doc* not fit iruo any of the defined class codes of 
the PCI SIG, so this register is set to OxFFOOOO. 

em. Cache Line Size 

This is areaoVwrite register that specifies the cache line size of the systemln 32 bit words. It 
determines when the coprocessor will use the Memory Read Line and Memory Read Multi- 
ple commands, the coprocessor supports any value from 0 to 255 in this register. A value of 
zero in this register disables the Memory Read Line and Memory Read Multiple command 
types. It is set to 0x00 on reset 

en. Latency Timer 

This is a read/write register that specifies the largest number of clocks the coprocessor can 
use for any PCI transaction the coprocessor supports any value in this register from 0 to 
255. It is set lo 0x00 on reset 

eo. Header Type 

This read-only register is set to 0x00, indicating that the coprocessor uses a type 0 layout for 
the configurctirm space. 

ep. Base Address 

Tliis read/write register is used to Innate the coprocessor's internal registers and memories, 
the local memory and the Generic Interface in the host's memory map. the coprocessor's 
various resources occupy 64 MBytes (not all locations are used), so only the top 6 bits of this 
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register are writable. The reraainin? address bits are all hardwired 10 acio. The lower four 
bits of tliis register are read-only control bits, which are also nnrdwircd 10 0. This indicates 
that the register refers to memory space, thai the coprocessor can be rnarjped anywhere in the 
32-bii address space of the host, and that the coprocessor resources are not prefe'.chabie 
when ii is a target- 

eq. Subsystem Vendor ID 

This iead-uuly register allows the host to identify the vendor of a PCI board plugged into the 
system (as opposed to the vendor of the component implementing ihe PCI interface on the 
hoard). THe contents of this register are loaded on reset via the serial configuration port on 
the E1C, 

er. Subsystem ID 

The read-only register allows the host to identify a PCI board plugged into the system. The 
contents of this register axe loaded on reset viache serial configu rati on port on the EIC This 
mechanism allows any required information about the board functionality or configuration 
to be encoded externally and read by the host. 

es. Interrupt Una 

This reaoVwrite register is used by the system software to record interrupt line routing infor- 
mation so that it is accessible to interrupt service software. It has no effect on any operations 
in the coprocessor. It is set to 0*00 on reset. 

bL Interrupt Pin 

This read-only register is hard-wired to OxOI , indtcaung that the coprocessor drives PCI 
interrupt pin 1 n ta_l. 

eu. Min_Gnt 

i 

This read only register indicates to the system the coprocessor's desired burst period len*tfi, 
in units of 1/4 microseconds. The opdmum value for this register has not yet been deter-*' 
mined. 

ev. Max_Lat 

This readonly register indicates to the system die coprocessor's desired maximum latency in 
gaining control of the PCI bus after a request, in units of 1/4 microseconds. The optimum 
value for this register has not yet been determined. 

1 .1.4 Interna! Memory Map 

This section details the objects that appear in the per-module data areas within the the 
coprocessor internal memory map. 



Table 1.144 the coprocessor internal Memory Map 









1MB 




0x0000- 
OxlFFF 


reserved 


reserved 






OxEOOO- 
0xR03K 


EIC 


eic_ptp 




MMU page taWe pointers 


0x8040- 
Ox807F 


reserved 


reserved 
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Table 1.144 the coprocessor Internal Memory Map 



j > base \; 






»««««*-'- : 




:'• * \ ' l' V - .7'" 


' r - ;- : " 
Associated 




4| : i(r^; .,?»^ 
SSa^^.:. "I" 


v TvO£ ' 


i ■ Dof trillion l u , > - . * > 


0x8080- 
OxBOFF 


EIC 


cic_addr 


Read 
Only 


CAM data widi vinual and physi- 
cal napp numbers intprlpavp-rl 


0x8100- 
OxFFFF 


reserved 


reserved 






Ox 10000- 
OxlFFFF 


LMC 


reserved 






0x20000- 
OX2007F 


PIC 




R/W 


32 word x32 bit RAM in PIC input 
datapath 


Ox2O0S0- 
0x200FF 


PIC 


output RAM 


R/W 


32 word x32 bit RAM in PIC out- 
put datapath 


0x20100 
0x2FFFF 


PIC 


reserved 






0x30000- 
Ox3FFFF 


MISC 


reserved 






0x40000- 
40017 


IC 


prefetch_buf 


read 
only 


Contents of Prefetch Buffer 


UX4U0 1 o- 

0x4FFFF 


IC 


reserved 






0x50000 


DCC 


cache RAM 


R'W 


4k byte of 32 bit wide memory 


0x51000 


DCC 


tag RAM 


R/W 


128 x 20 bit tag memory - the Most 
significani 20 bits of the 32 bit 
word arc used 


0x52000 


DCC 


dcc_odatabO 


R/W 


bits 3 1:0 of o_datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52004 


DCC 


dcc_odatabi 


K/W 


biis 63:32 of o. datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52008 


DCC 


dcc_odatab2 


R/W 


bits 95:64 of o_da tab bus. This 
value is driven on to the bus in the 
test mode. 


Ox5200c 


DCC 


dcc_odatab3 


R/W 


bits 127:96 of o_datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52010 


IXC 


dcc^odatacO 




ojis Ji.uoi o__aat:ac dus. irus 
value is driven on to the bus in the 
test mode. 


0x52014 


DCC 


dcc_odatacl 


R/W 


bits 63:32 of o_datac bus. This 
value is driven on to the bus in the 
test mode. 


0x52019 


DCC 


dccodatac2 


R/W 


bits 95:64 of o_datac bus. This 
value is driven on to the bus in the 
test mode. 
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Table 1.144 



2tl 

the coprocessor Internal Memory Map 
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1-1.5 Memory Word Fields 
a. eicjtp 

Table 1.1 45 eicjrtp Memory Word Fields 





./.Name 


, ; ' Definition " . .,.r 


11:0 


Reserved 




31:12 


ptp 


Top 20 bits of physical byte address of the bottom 
of a 4kB segment of the page table. 
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4. BR1HF DESCRIPTION OF THE DRAWINGS 

Notwithstanding any other forms which may fall within the scope of the present 
invention, preferred forms of the invention will now be described, by way of example 
5 only, with reference to the accompanying drawings: 

Fig. 1 illustrates the operation of a raster image co-processor within a host 
computer environment; 

Fig. 2 illustrates the raster image co-processor of Fig. 1 in further detail; 
Fig. 3 illustrates the memory map of the raster image co-processor; 
10 Fig. 4 shows the relationship between a CPU, instruction queue, instruction 

operands and results in shared memory, and a co-processor; 

Fig. 5 shows the relationship between an instruction generator, memory manager, 
queue manager and co-processor; 

Fig. 6 shows the operation of the graphics co-processor reading instructions for 
15 execution from the pending instruction queue and placing them on the completed 
instruction queue; 

Fig. 7 shows a fixed length circular buffer implementation of the instruction 
queue, indicating the need to wait when the buffer fills; 

Fig. 8 illustrates to instruction execution streams as utilized by the co-processor; 
20 Fig. 9 illustrates an instruction execution flow chart; 

Fig. 10 illustrates the standard instruction word format utilized by the co- 
processor; 

Fig. 11 illustrates the instruction word fields of a standard instruction; 

Fig, 12 illustrates the data word fields of a standard instruction; 
25 Fig. 13 illustrates schematically the instruction controller of Fig. 2; 

Fig, 14 illustrates the execution controller of Fig. 13 in more detail; 

Fig. 15 illustrates a state transition diagram of the instruction controller; 

Fig. 16 illustrates the instruction decoder of Fig. 13; 

Fig. 17 illustrates the instruction sequencer of Fig. 16 in more detail; 
30 Fig. 18 illustrates a transition diagram for the ID sequencer of Fig. 16; 

Fig, 19 illustrates schematically the prefetch buffer controller of Fig. 13 in more 

detail; 



-606- 



-265- 

Fig. 20 illustrates the standard form of register storage and module interaction as 
utilized in the co-processor; 

Fig. 21 illustrates the format of control bus transactions as utilized in the co- 
processor; 

Fig. 22 illustrates the data flow through a portion of the co-processor; 
Figs. 23-29 illustrate various examples of data reformatting as utilized in the co- 
processor; 

Figs. 30 and 31 illustrate the format conversions carried out by the co-processor; 
Fig. 32 illustrates the process of input data transformation as carried out in the 
co-processor; 

Figs. 33-41 illustrate various further data transformations as carried out by the 
co-processor; 

Fig. 42 illustrates various internal to output data transformations carried out by 
the co-processor; 

Figs. 43-47 illustrate various further example data transformations carried out by 
the co-proccssor; 

Fig. 48 illustrates various fields utilized by internal registers to determine what 
data transformations should be carried out; 

Fig. 49 depicts a block diagram of a graphics subsystem that uses data 
normalization.; 

Fig. 50 illustrates a circuit diagram of a data normalization apparatus; 
Fig. 51 illustrates the pixel processing carried out for compositing operations; 
Fig. 52 illustrates the instruction word format for compositing operations; 
Fig. 53 illustrates the data word format for compositing operations; 
Fig. 54 illustrates the instruction word format for tiling operations; 
Fig. 55 illustrates the operation of a tiling instruction on an image; 
Fig. 56 illustrates the process of utilization of interval and fractional tables to re- 
map color gamuts; 

Fig. 57 illustrates the form of storage of interval and fractional tables within the 
MUV buffer of the co-processor; 

Fig. 58 illustrates the process of color conversion utilising interpolation as 
carried out in the co-processor; 
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Fig. 59 illustrates the refinements to the rest of the color conversion process al 
gamut edges as carried out by the co-processor; * 

Fig. 60 illustrates the process of color space conversion for one output color as 
implemented in the co-processor; 

Fig. 61 illustrates the memory storage within a cache of the co-processor when 
utilising single color output color space conversion; 

Fig. 62 illustrates the methodology utilized for multiple color space conversion; 

Fig. 63 illustrates the process of address re-mapping for the cache when utilized 
during the process of multiple color space conversion; 

Fig. 64. illustrates the instruction word format for color space conversion 
instructions; 

Fig. 65 illustrates a method of multiple color conversion; 

Fig. 66 and 67 illustrate the formation of MCLTs during the process of JPEG 
conversion as carried out in the co-processor; 

Fig. 68 illustrates the structure of the JPEG coder of the co-processor; 

Fig. 69 illustrates the quantizer portion of Fig. 68 in more detail; 

Fig. 70 illustrates the Huffman coder of Fig- 68 in more detail; 

Figs. 71 and 72 illustrate the Huffman coder and decoder in more detail; 

Figs. 73-75 illustrate the process of cutting and limiting of JPEG data as utilized 
in the co-processor; 

Fig. 76 illustrates the instruction word format for JPEG instructions; 

Fig. 77 shows a block diagram of a typical discrete cosine transform apparatus 
(prior art); 

Fig, 78 illustrates an arithmetic data path of a prior art DCT apparatus; 

Fig. 79 shows a block diagram of a DCT apparatus utilized in the co-processor; 

Fig. 80 depicts a block diagram of the arithmetic circuit of Fig. 79 in more detail; 

Fig. 81 illustrates an arithmetic data path of the DCT apparatus of Fig. 79; 

Fig. 82 presents a representational stream of Huffman-encoded data units 
interleaved with not encoded bit fields, both byte aligned and not, as in JPEG format; 

Fig, 83 illustrates the overall architecture of a Huffman decoder of JPEG data of 
Fig. 84 in more detail; 

Fig. 84 illustrates the overall architecture of the Huffman decoder of JPEG data; 
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Fig. 85 illustrates data processing in the stripper block which removes byte 
aligned not encoded bit fields from the input data. Examples of the coding of tags 
corresponding to the data outputted by the stripper are also shown; 

Fig. 86 shows the organization and the data flow in the data preshifter; 
5 Fig. 87 shows control logic for the decoder of Fig. 81; 

Fig. 88 shows the organization and the data flow in the marker preshifter; 

Fig. 89 shows a block diagram of a combinatorial unit decoding Huffman 
encoded values in JPEG context; 

Fig. 90 illustrates the concept of a padding zone and a block diagram of the 
10 decoder of padding bits; 

Fig. 91 shows an example of a format of data outputted by the decoder, the 
format being used in the co-processor; 

Fig. 92 illustrates methodology utilized in image transformation instructions; 

Fig. 93 illustrates the instruction word format for image transformation 
15 instructions; 

Figs 94 and 95 illustrate the format of an image transformation kernal as utilized 
in the co-processor; 

Fig. 96 illustrates the process of utilising an index table for image 
transformations as utilized in the co-processor; 
20 Fig. 97 illustrates the data field format for instructions utilising trausformations 

and convolutions; 

Fig. 98 illustrates the process of interpretation of the bp field of instruction 

words; 

Fig. 99 illustrates the process of convolution as utilized in the co-processor; 
25 Fig. 100 illustrates the instruction word format for convolution instructions as 

utilized in the coprocessor; 

Fig. 101 illustrates the instruction word format for matrix multiplication as 
utilized in the co-processor; 

Figs 102-105 illustrates the process utilized for hierarchia! image manipulation 
30 as utilized in the co-processor; 

Fig. 106 illustrates the instruction word coding for hierarchial image instructions; 

Fig. 107 illustrates the instruction word coding for flow control instructions as 
illustrated in the co-processor; 
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Fig, 108 illustrates the pixel organizer in more detail; 
Fig. 109 illustrates the operand fetch unit of the pixel organizer in more detail; 
Figs. 110-114 illustrate various storage formats as utilized by the co-processor; 
Fig. 115 illustrates the MUV address generator of the pixel organizer of the co- 
5 processor in more detail; 

Fig. 1 16 is a block diagram of a multiple value (MUV) buffer utilized in the co- 
processor; 

Fig. 117 illustrates a structure of the encoder of Fig. 116; 
Fig. 118 illustrates a structure of the decoder of Fig. 11G; 
10 Fig. 119 illustrates a structure of an address generator of Fig. 1 1 6 for generating 

read addresses when in JPEG mode (pixel decomposition); 

Fig. 120 illustrates a structure of an address generator of Fig. 116 for generating 
read addresses when in JPEG mode (pixel reconstruction); 

Fig. 121 illustrates aa organization of memory modules comprising the slUTagc 
1 5 device of Fig 116; 

Fig. 122 illustrates a structure of a circuit that multiplexes read addresses to 
memory modules; 

Fig. 123 illustrates a representation of how lookup table entries are stored in the 
buffer operating in a single lookup table mode; 
20 Fig. 124 illustrates a representation of how lookup table entries are stored in the 

buffer operating in a multiple lookup table mode; 

Fig. 125 illustrates a representation of how pixels are stored in the bullet 
operating in JPEG mode (pixel decomposition); 

Fig. 126 illustrate a representation of how single color data blocks are retrieved 
25 Tram the buTfer operating in JPEG mode (pixel reconstruction); 

Fig. 127 illustrates the structure of the result organizer of the co-processor in 
more detail; 

Fig. 128 illustrates the structure of the operand organizers of the co-processor in 
more detail ; 

30 Fig. 129 is a block diagram of a computer architecture for the main data path unit 

utilized in the cu-processoi; 

Fig. 130 is a block diagram of a input interface for accepting, storing and 
rearranging input data objects for further processing; 
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Fig. 131 is a block diagram of a image data processor for performing arithmetic 
operations on incoming data objects; 

Fig. 132 is a block diagram of a color channel processor for performing 
arithmetic operations on one channel of the incoming data objects; 

Fig. 133 is a block diagram of a multifunction block in a color channel processor; 

Fig. 134 illustrates a block diagram for compositing operations; 

Fig. 135 shows an inverse transform of the scanline; 

Fig. 136 shows a block diagram of the steps required to calculate the value for a 
designation pixel: 

Fig, 137 illustrates a block diagram of the image transformation engine; 
Fig. 138 illustrates the two formats of kernel descriptions; 
Fig, 139 shows the definition and interpretation of a bp field; 
Fig. 140 shows a block diagram of multiplier-adders that perform matrix 
multiplication; 

Fig. 141 itlustiales ihe control, address and data flow of the cache and cache 
controller of the co-processor; 

Fig. 142 illustrates the memory organization of the cache; 

Fig. 143 illustrates the address format for the cache controller of the co- 
processor; 

Fig. 144 is a block diagram of a multifunction block in a color channel processor, 

Fig. 145 illustrates the input interface switch of the co-processor in more Fig. 
144 illustrates, a block diagram of the cache and cache controller; 

Fig. 146 illustrates a four-port dynamic local memory controller of the co- 
processor showing the main address and data paths; 

Fig. 147 illustrates a state machine diagram for the controller of Fig. 146; 

Fig. 148 is a pseudo code listing detailing the function of the arbitrator of Fig. 

146; 

Fig. 149 depicts the structure of the requester priority bits and the terminology 
used in Fig. 146. 

Fig. 150 illustrates the external interface controller of the co-processor in more 

detail; 

Figs. 151-154 illustrate the process of virtual to/from physical address mapping 
as utilized by the co-processor; 
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Fig. 155 illustrates the IBus receiver unit of Fig. 150 in more detail; 

Fig. 156 illustrates the RBus receiver unit of Fig. 2 in more detail; 

Fig. 157 illustrates the memory management unit of Fig. 150 in more detail; 

Fig. 158 illustrates the peripheral interface controller of Fig. 2 in more detail. 
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